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1. FILTERS 

For over three hundred years, a basic question about the calculus remained unanswered. Do 
the infinitesimals, as conceptional understood by Leibniz and Newton, exist as formal mathematical 
objects? This question was answer affirmatively by Robinson (1961) and the subject termed "Non- 
standard Analysis" (Robinson, 1966) was introduced to the scientific world. As part of this book, 
the mathematical existence of the infinitesimals is established and their properties investigated and 
applied to basic real analysis notions. 

I intend to write this "book" informally and it's certainly about time that technical books be 
presented in a more "friendly" style. What I'm not going to do is to present an introduction filled 
with various historical facts and self-serving statements; statements that indicate what an enormous 
advancement in mathematics has been achieved by the use of Nonstandard Analysis. Rather, let's 
proceed directly to this simplified approach, an approach that's correct but an approach that cannot 
be used to analysis certain areas of mathematics that arc not classified as elementary in character. 
These areas can be analysis but it requires one to consider additional specialized mathematical 
objects. Such specialized objects need only be considered after an individual becomes accustomed to 
the basic methods used within this simple approach. There are numerous exciting and thrilling new 
concepts and results that cannot be presented using the simple approach discussed. The "internal" 
objects, objects that "bound" sets that represent "concurrent" relations and saturated models are 
for your future consideration. The main goal is to present some of the basic nonstandard results 
that can be obtained without investigating such specialized objects. 

I'll present a complete "Proof" for a stated result. However, one only needs to have confidence 
that the stated "Theorem" has been acceptably established. Indeed, if you simply are interested 
in how these results parallel the original notions of the "infinitesimal" and the like, you need not 
bother to read the proofs at all. 

There's an immediate need for a few set-theoretic notions. We let IN be the set of all natu- 
ral numbers, which includes the zero as the first one. I assume that you understand some basic 
set-theoretic notation. Further, throughout this first chapter, X will always denote a 
nonempty set. Recall that for a given set X the set of all subsets of X exists and is called the 
power set. It's usually denoted by the symbol V{X). For example, let X = {0, 1, 2}. The power 
set of X contains 8 sets. In particular, V(X) = {0, X, {0}, {1}, {2}, {0, 1}, {0, 2}, {1, 2}}, where 
denotes the empty set, which can be thought of as a set which contains "no members." 

Definition 1.1. (The Filter.) (The symbol C means "subset" and includes the possible 
equality of sets.) A (nonempty) ^ T C V{X) is called a (proper) filter on X if and only if 

(i) for each A, B e J 7 , A n B € T\ 

(ii) if A C B C X and A e J 7 , then Bef; 

(iii) i T. 

Example 1.2. (i) Let ^ A c X. Then [A] | is the set of all subsets of X that contain A, or, 
more formally, [A] 1= {x \ x C X and A c x}, is a filter on X called the principal filter. 

How to properly define what one means by a "finite" set has a long history. But as Suppes 
states "The common sense notion is that a set is finite just when it has "m" members for some 
non-negative integer m. [You can use our set M to get such an "m."] This common sense idea is 
technically sound . . . ." (1960, p. 98). The notion of finite can also be related to constants 
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that name members of a set and the formal expression that characterizes such a set in terms of the 
symbols "=" and "V" (i.e. "or"). Notice that the empty set is a "finite set." This association to the 
natural numbers is denoted by subscripts that actually represent the range values of a function. I 
also assume certain elementary properties of the finite sets and those sets that are not finite. For 
X, let 7^ B C V{X) have the finite intersection property. This means that each Bi £ B is 
nonempty and that the "intersection" of all of the members of any other finite subset of B is not 
the empty set. Given such aB / V{X), then we can generate the "smallest" filter that contains B. 
This is done by first letting B' be the set of all subsets of V{X) formed by taking the intersection 
of each nonempty finite subset of B, where the intersection of the members of a set that contains 
but one member is that one member. Let's consider some basic mathematical abbreviations. The 
formal symbol A means "and" and the formal "quantifier" 3 means "there exists such and such." 
Now take a member B of B' and build a set composed of all subsets of X that contain B. Now do 
this for all members of B' and gather them together in a set to get the set (B). Hence, {B} is the 
set of all subsets x of X such that there exists some set B such that B is a member of B 1 and B is 
a subset of x. Formally, (B) ={i|(iCl)A (3-B((B e B') A (B C x))}. I have "forced" (B) to 
contain B and to have the necessary properties that makes it a filter. 

There exists a very significant "filter" C on infinite X defined by the notion of not being finite. 
This object, once we have shown that it is a filter on X, is called the cofinite filter. [Cofinite means 
that the relative complement is a finite set]. 

Definition 1.3. (QED and iff.) Let C = {x | (x C X) A (X - x) is finite)}. Also I will 
use the symbol | for the statement "QED," which indicates the end of the proof. Then "iff" is an 
abbreviation for the phrase "if and only if." 

Theorem 1.4. The set C is a filter on infinite X and, the intersection of all members of C, 
f){F\F eC} = ®. 

Proof. Since I^f), then there is some a e X. Further, X — (X — {a}) = {a} implies that C ^ 0. 
So, assume that A,B eC. Then since X - (A n B) = (X - A) U (X - B), X - (A n B) is a finite 
subset of X. Thus, since X - (X - (A n B)) = A n B, then A n B e C. Now suppose A C C C X. 
Then X - C C X - A. Thus, because X - A is finite, then X - C is finite. Hence, C e C. Also, 
since X is infinite, then X — = X implies that ^ C. Consequently, C is a filter on X. 

Now observe that K = X - f]{F \ F e C} = \J{X - F \ F e C} = X, for if a e X, then 
X-{a} eC implies that X - (X - {a}) = {a} C K. Hence, we must have that n{F | F e C} = 0. | 

I mention that C is also called the Frechet filter, ft turns out that we are mostly interested in 
a maximum filter that contains C. 

Definition 1.5. (Ultrafilter.) A filter U on X is called an ultrafilter iff whenever there's a 
filter J 7 on X such that U C T, then U = T . 

Prior to showing that ultrafilters exist, let's see if they have any additional useful properties. 

Theorem 1.6. Suppose that U is an ultrafilter on X . If A(J B eW, then A eM or B eW. 

Proof. Let A £ U and B $ U but A U B e U. Let Q = {x | (x C X) A (A U x e U}. We show 
that Q is a filter on X. [Note: We now begin to use variables such as x,y, z etc. as mathematical 
variables representing members of sets. These symbols are used in two context, however. The other 
context is as a variable in our formal logical expressions.] Let x, y, e Q. Then iUi, A U y e U. 
Hence, (A U x) n (A U y) = A U (x n y) G hi implies that x U y <G Q. Now suppose that x £ Q and 
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iCjCl Then AUiCiUt/ implies that AUy eU. Hence, y £ Q. Also A U £ U implies that 
£ Q. Thus Q is a filter on A. 

Let C eU. Then C C AuC £U implies that Ce^. Therefore, U cQ. But, B £ Q implies 
that U ^Q. This contradicts the maximum aspect for | 

Theorem 1.7. Let J 7 be a filter on X. Then T is an ultrafilter iff for each A C X , either 
AeU or X - AeU, not both. 

Proof. Assume J 7 is a ultrafilter. Then X = X U (X — A) implies that either A £ U or 
X - AeU. Both A and X - A cannot be members of U, for if they were then ^4 n (A - A) = eW; 
a contradiction. 

Conversely, suppose that for each A C A, either ieforl-Aef. Let Q be filter on A 
such that T C </. Let A £ Q- Then X - A £ Q since £ is a filter. Thus, X- A^T. Hence, A £ T. 
Thus implies that Q=T.\ 

Given any filter T on A a major question is whether there exists an ultrafilter U on A such 
that F C U The answer to this question can take on, at least, two forms. The next result states 
that such ultrafilters always exist. The proof in the appendix uses a result, Zorn's Lemma, that is 
equivalent to the Axiom of Choice. The Axiom of Choice, although it's consistent with the other 
axioms of set theory, may not be "liked" by some. There's an axiom that is also consistent with the 
other usual axioms of set theory that is weaker than the Axiom of Choice. What it states is that 
such an ultrafilter always exits. So, you can take your pick. 

Theorem 1.8. Let T be a filter on X. Then there exists an ultrafilterU on X such that T CU. 
Proof. See the appcndix.| 

A natural study is to see if we can partition the set of all ultrafilters defined on A into different 
categories. And, why don't we use the symbol Fx [resp. Ux] to always denote a filter [resp. 
ultrafilter] on A. It turns out there are two basic types of Ux, the principal ones and those that 
contain Cx ■ 

Theorem 1.9. Let p £ A. Then [p] | is anUx- 

Proof. Let nonempty A a X. Then either p € A or p e (A — A) and not both. Thus A € [p] | 
or (A — A) E [p] t • Hence, by Theorem 1.3, [p] f is an Ux- 

Theorem 1.10. Assume thatUx is not a principal ultrafilter. Then Cx C Ux- 

Proof. Let arbitrary nonempty finite {po, . . . ,pk} C A. Since Ux is non-principal, then Ux ^ 
[pi] |, i = 0, . . . , k. Hence, for each i = 0, . . . , k there exists some Ai C A such that Ai £ Ux and 
Pi £ Ai. For, otherwise, if pi £ Ai for any Ai £ Ux, then [pi] fc Ux (they are =.) Consequently, 
{p , - - - ,Pk} n (A n • • • n A k ) = 0. However, (A n • • • D A k ) £U X - Therefore, {p , - - - ,Pk} $ U X - 
Theorem 1.3 implies that A — {p , - - - ,Pk} & Ux- Thus, Cx C Ux- I 

Non-principal ultrafilters are also called free ultrafilters. This comes from Theorems 1.4 and 
1.10 which imply that Ux is free iff f]{F \ F £ Ux} = 0- Also, another characterization is that Ux 
is free iff there does not exist a nonempty finite F C A such that F £ Ux- If we let A = IN, then 
there are a lot of free Ux- Unless otherwise stated, the free ultrafilter that's used will not affect any 
of the stated results. 
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2. A SIMPLE NONSTANDARD MODEL FOR ANALYSIS 

We let IR denote the real numbers. The set IR uses various operators and relations to obtain 
results within analysis. For this simplified approach, most of what we need is defined from the basic 
addition +, multiplication •, total order < properties and few other ones accorded IR. For convenience, 
I denote this fact by the structure notation (IR, +,-,<, where the $i are any other relations 
one might consider for IR whether definable from the basic relations or not. Further, it is always 
understood that each structure includes the = relation, which is but set-theoretic equality or identity 
for members of TR. I'll have a little more to write about how these "numbers" should be viewed 
later. But, first to our construction. Let IR N represent the set of all sequences with domain 
M and range values (images) in TR. Of course, sequences are functions, (maps, mappings, etc.) 
that are often displayed as a type of "ordered" set in the form {so, Si, S2, . . .}. You can define binary 
operators + and •, among others, for sequences by simply taking any two /, g G TR N and defining 
/ + g = h to be the sequence h where the values of h are h(n) — f(n) + g(n) and / • g = fg = k 
to be the sequence k where the values of k are k(n) = f(n)g(n) for each net This forms, at the 
very least, what is called a ring with unity. What I'll do later is to show that there's a subset of 
TR N that "behaviors" like the real numbers, with respect to the defined relations, and we'll us this 
subset as if it is the real numbers. In all the follows, U = will always be a free ultrafiltcr and 
the symbol U is used to represent members of U. Now to make things symbolically simple capital 
letters from the beginning of the alphabet A,B,C,... will always denote members of TR N . 
Also, we usually use the subscript notation for the images. Now let us begin our construction of a 
nonstandard model for real analysis. 

Definition 2.1. (Equality in U) Let A, B e TR N . Define A = u B iff {n \ A n = B n } = U eU. 

(The set of all IN such that the values of the sequences A and B are equal.) 

It has been said that the most important binary relation within mathematics is the equivalence 
relation. This relation, in general, behaves like = except that you may not be allowed to "substitute" 
one equivalent object for another. Recall that for a set X a binary relation R is an equivalence 
relation on X iff it has the following properties. For each x,y, z G X, (i) xRx (reflexive property); 
(ii) if xRy, then yRx (symmetric property); [Note that if this holds, then xRy iff yRx] (iii) if xRy 
and yRz, then xRz (transitive property). Hence, it is almost an "equality." 

Theorem 2.2. The relation =u is an equivalence relation on IR N . 

Proof. Of course, properties of the = for members of IR are used. First, notice that {n \ A n = 
A n } = IN G U for any A G B N . Thus, the relation is reflexive. 

Clearly, for any A, B G IR, if {n \ A n = B n } G U, then {n \ B n = A n } G U. 

Finally, suppose that A,B,Ce IR n and A =u B and B =u C. Hence, {n \ A n — B n } G U and 
{n | B n — C n } G U. The word "and" implies, since U is a filter, that {n \ A n = B n } n {n | B n = 
C n } G U. Of course, this "intersection" need not give all the values of IN that these three sequences 
have in common, but that does not matter since the "superset" property for a filter implies from 
the result 

{n | A n = B n } n {n \ B n = C n } C {n \ A n = C n } 

that {n | A„, = C n } eU.% 

[Note: In the above "proof," the two step process of getting the common members by the 
"intersection" and using the superset property is a major proof method.] 
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Definition 2.3. (Equivalence classes.) We now use the relation =u to define actual subsets 
of Il N . For each A £ m N , let the set [A] = {x \ (x £ m N ) A (a; = u A)}. 

It is easy to show that for each A,Bel N , either [A] = [B] or [A] n [B] = 0. (The "=" here 
is the set-theoretic equality.) Further, TR N = \J{[x] \ x £ TR N }. That is the set M N is completely 
partitioned (separated into, broken up into) these non-overlapping nonempty sets. Because of 
these properties, we can use any member of the set [A] to generate the set. That is if B, C £ [A], 
then [A] — [B] — [C]. As to notation, when I'm not particular interested in a sequence that generates 
the equivalence class, I'll denote them by lower case letters a, b, c, . . .. 

Denote the set of all of these equivalence classes by * IR and call this set the set of all hyperreal 
numbers. (The * is often translated as "hyper.") Consequently, *TR = {[A] \ A £ M N }. After 
various relations are defined on *TR, the resulting "structure" is generally called an ultrapower. 
Indeed, it's this ultrapower that will act as our nonstandard model for portions of real analysis. 
There's still a lot of work to do to turn *]R into a such a model, but to motive this work I'll simply 
mention that if you take a sequence s that converges in the normal calculus sense to 0, then [s] is one 
of our infinitesimals. What will be done, after the ultrapower model is constructed, is to "embed" 
(IR, +, •, <, $,) into the ultrapower so that comparisons can be easily made between the "standard" 
objects that represent the properties of the actual real numbers and other objects in the ultrapower. 
The notation (IR, +,-,<, <3>i) identifies the carrier, IR, as well as certain specialized relations defined 
for (on) the carrier. 

There are two approaches to analyze this ultrapower, a direct and tedious method, and a method 
that uses notions from Mathematical Logic. Once everything is constructed and the embedding is 
secured, then the embedded objects become our standard objects. The set of nonstandard objects 
is the remainder of the ultrapower. 

Definition 2.4. (Addition and multiplication for the *IR.) Consider any a,b,c £ *IR. 
Define a *+ b = c iff {n \ A n + B n = C n } £ U. [Note: such definitions assume that you have selected 
some sequences A n £ a, B n £ b, C n £ c. Now define a* b = c iff {n | (A n ) ■ (B n ) = C n } £ U. 

Whenever such definitions are made by taking members of a set that contains more than one 
member it is always necessary to show that they are well-defined in that the result is not dependent 
upon the member one chooses. The next result shows how this is done and gives insight as to how 
it will be done later in completely generality. 

Theorem 2.5. The operations defined in definition 2.4 are well-defined. 

Proof. Let [A], [D] £ a, [B], [F] £ b. Notice that {n | A n = D n } £ U} and {n \ B n = F n } £ U 
implies that {n | A n = D n } n {n | B n = F n } £ U and {n \ A n = D n } n {n | B n = F n } C {n | 
A n + B n = D n + F n } implies by the superset property that {n | A n + B n = D n + F n } £ U. Thus 
the *+ is well-defined. (Note: Processes of this type that use filter properties that imply something 
is a member of a filter will be abbreviated.) In like manner, for the *• . | 

Thus far, the fact the U is an ultrafilter has not been used. But, for the structure (*IR, *+, *•) to 
have all the necessary mathematical "field" properties, this ultrafilter property is significant. That 
is so that the *+, *• arithmetic behaves for *H, like +, • behave for real number arithmetic. 
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Theorem 2.6. For the structure (*TR, *+, *•) 

(i) [0] is the additive identity; 

(ii) for each a = [A] G *2R, — a = [— A] is the additive inverse; 

(iii) [1] is the multiplicative identity; 

(iv) If a ^ [0] , then there exists b — [B] G * E such that a*b — [1] . 

(v) For each n G M if D n = A n +B n and E n = A n B n , then [A] *+[B] = [D], [A] *-[B] = [E]. 
That is our definitions for addition and multiplication of sequences and the hyper- operators *+, *■ 



are compatible. 

Proof, (i) Let [A] *+ [0] = [C]. Considering that {n \ A n + 0„ = C„} G U and {n \ A n + 0„ = 
C n } C {n | A n = C„} G W, then [A] = [C]. 

(ii) Let [-A] = [B]. Then once again {n \ A n + {-A n ) = = 0„} = U G W and thus 
[A]*+[-A] = [0]. 

(iii) This follows in the same manner as (i). 

(iv) Let [A] ^ [0]. Then {n | A n = = 0„} = U £U. Hence, BJ - [/ = {n | A n 7^ 0} G W since 
W is an ultrafilter. Define 



Notice that {n \ A n ■ B n = 1 = 1„} = {n \ A n ^ 0} G Hence, [A] *• [S] = [1]. 

(v) By definition, [A] + [B] = [C] iff {n \ A n + B n = C n } G U. However, {n \ A n + B n = D n } = 
meU. Hence, {n \ A n + B n = C n } n {n \ A n + B n = D n } = {n \ C n = D n } G «}. Thus, [C] = [£>]. 
In like manner, the result holds for "multiplication."! 

Clearly, one can continue Theorem 2.6 and show that *+, *•) satisfies all of the "field" 
axioms. It should be obvious, by now, how the "order" relation for *K is defined. 

Definition 2.7 ( Order) For each a = [A], b = [B] G *B define a *<b iff {n | A n < B n } G U. 

I won't show that this relation is well-defined at this time since I'll do it later for all such 
relations. But, we might as well show that this *< is, indeed, a total order and for (*M, *+, *, *<) 
as a binary relation only *< behaves like the < behaves for TR. 

Theorem 2.8. The structure (*TR, *+, *, *<) is a totally ordered field. 
Proof. First, notice that {n \ A n < A n } = M G U. Thus, *< is reflexive. 

Next, this relation needs to be anti-symmetric. So, assume that [A] *<[£?], [B]*<L4]. Then 
{n I A n < B n } n {n \ B n < A n } C {n | A n = B n } G U. Hence, [A] = [B\. 

For transitivity, consider [A] *<[B], [B] *<[C]. Then {n \ A n < B n } n {n \ B n < C' n } C {n \ 
A n < C n } G U. Thus, [A] *<[C]. (Notice that the same processes seem to be used each time. They 
are that U is closed under finite intersection and supersets.) 

Next to the notion of "totally." Let [A] , [B] G *TR. Suppose that [A] Thus from the 

trichotomy law for m, {n \ A n > B n } G U. Hence, L4]*>L3] or [A] *<[B] or [A] = [B]. To show 
that it is a totally ordered "field" all that's really needed is to show that it satisfies two properties 
related to this order and the *+, *• operators. So, let [A], [B], [C] G *m. Let [A] *< [B]. Then 
{n I A n < B n } C {n\ An + Cn < B n + C n } G U. Thus [A] *+ [C] *<[B] *+ [C]. Now suppose that 
[0] *< [A], [B]. Then {n \ < A n } U {n \ < B n } C {n | < A n B n } G U. \ 
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By the way, using repeatedly the ultrafilter properties to establish the above results is actually 
unnecessary when a more general result from Mathematical Logic is used. It's this general result 
that gives me complete confidence that these theorems can be established directly. Indeed, the 
very definition for the * operators comes from this more powerful approach. A major one of these 
Mathematical Logic results I'll introduce shortly. 

There is often introduced into this subject certain concepts from abstract algebra and abstract 
model theory. I've decided to avoid this as much as possible for this simplified version. But, now 
and then, I need to simply state that something holds due to results from these two areas and you 
need to have confidence that such statements are fact. 

What happens next is to "embed" the structure (IR, +,-,<) into *+, *, *<) so that the 
relations +,-,< can be considered as but the relations *+, *, *< restricted to IR. All one does is to 
define a function / that takes each x G IR and gives the unique [R], where {n \ R n = x} e U. Notice 
that one such representation for [R] is the sequence X n = x for each n € I. Then {n \ X n = x} = 
This is called the constant sequence representation for x in *TR. This function determines 
what is called a model theoretic isomorphism when the relations *+, *, *< are restricted to 
the [X] and is what is used to embed (IR, +, •, <) into (*IR, *+, *, *<). One of the big results from 
abstract model theory states that if one expresses the properties of (IR, +, ■, <) in the customary 
mathematicians' way (as a first-order predict statement with constants), then every theorem that 
holds true in (IR, +,-,<) will hold true when interpreted within this embedding. It's important 
to note that the real numbers IR are constructed within our basic set theory. Hence, the object 
IR has a lot of properties. It's assumed that all such properties that can be properly expressed 
using our present or future defined operations or relations also hold for the structure (IR, +, ■, <). 
Thus, simply consider (IR, +, ■, <) as a piece (a substructure) of the structure (*]R, *+, *, *<). Under 
this embedding, the notation can be simplified somewhat, by dropping the * from the relations 
*+, *, *< always keeping in mind that the structure (IR, +, •, <) is formed by simply restricting these 
relations to members of the embedded IR. As mentioned each object with which we work and that 
becomes part of this embedding will be called a standard object. All other objects discussed are 
nonstandard objects. 

At this point, I could go onto some abstract algebra and show without any doubt that the 
structures (IR, +,-,<) and (*IR, +,-,<), although they are both totally order fields, are not the 
same. But, let's just show that there is a property that (IR, +, ■, <) has that (*IR, +, ■, <) does not 
have. 

Theorem 2.9. A field property holds for (TR, +, ■, <) that does not hold for (*TR, +, ■, <). 

Proof. There is a property of (IR, +,-,<) that states that for each < r e TR there exists 
an n £ IN such that r < n. Now the set IN is a subset of IR and in the embedded form (not yet 
introduced) IN C *IR. Consider the sequence A n = n. Then [A] <E *TR. The ultrafilter U is free and 
does not contain any finite sets. Thus, for each m e M, {n \ A n < m} U. Hence, {n \ A n > m} e U. 
This means that [A] > [M] = m. Since m is arbitrary, then [A] > [M], for each m € IN. Hence, at 
least for the ordinary embedded IN, this field property for (IR, + , •, <) does not hold for (*IR, +, •, <). 
| For those that understand the terminology, the field *IR is also not complete. 

From our definition, the A n = n used to establish Theorem 2.9 would be a nonstandard object. 
Now let's add a vast number of additional relations $i to our structures. This will allow us to apply 
these notions to analysis. The next idea is to "carve out" from our set theory some of the important 
set-theoretic objects used throughout nonstandard analysis. 



12 



Nonstandard Analysis Simplified 



Definition 2.10. ( Hyper (*) Extensions of standard objects.) LetU be a free ultrafilter. 
For any C C IR (a 1-ary relation), let b = [B] £ *C, iff {n \ B n £ C'} £ U. Let $ be any k-ary 
(k > 1) relation. Then (a 1; ...,a fe ) = ([Ai], ■ ■■ , [A k ]) £ *$ iff {n \ (A^n), . . . , A k {n)) 
This extension process can be continued for other mathematical entities as required. 

Now if it's shown, in general, that these definitions are well-defined, then we can add to our 
structure additional n-ary relations and get the structures (TR, +, •, <, $j) and (*1R, +, •, <, 
In which case, as before, we would have that $ C because all of these relations are actually 
defined in terms of members taken from IR. 

Theorem 2.11. The hyper- extensions defined in 2.10 are well-defined. 

Proof. In general, for any [B] € * IR, let [B] = [B 1 ] . That is let B' € IR N be any other member 
of the equivalence class. Let Cel. Then 

{n\B n = B'J C {n \ (B n £ C) if and only if (B' n £ C)}, 

{n | B n £ C} n {n | (B n £ C) if and only if (B' n G C)} C {n \ B' n e C}, 

{n | B' n eC}n{n\ (B n e C) if and only if (B' n e C)} C {n \ B n S C}. 

The result for this case follows. 

For the other k-ary relations, proceed as just done but alter the proof by starting with 

{n | = B[(n)} n • • • D {n | Bi(n) = Bi(n)} C 

{n | (B^n), . . . B fc (n)) G $ if and only if {B[(n), . . . B' k (n)) e $}. 
This completes the proof. | 

Definition 2.12. (Standard objects operator <T .) I'm using symbols such as x,y,z,w to 

represent members of IR or for n > 1 as members of IR™ = IR x • • • x IR, with "n" factors. Later, 
the Roman font for "variables" in formal expressions is used. For each x e IR, let *x — [X] £ *TR, 
where {n \ X n = x} = IN (the constant sequence). Then for X C IR, let a X = { *x \ x e X} C *TR. 
For n > 1 and each x = [xi,...,x n ) e IR", let *x = [*xi,..., *x n ) e *(lR n ). For X C IR", 
"I = {*i|iel}c *(IR"). Each such *x and 'X is called a standard object. Thus, CT IR is the 
set of embedded real numbers. 

What Definition 2.12 docs is to identify within (*TR, +, •, <, the embedded (IR, +, •, <, $,) 
objects. For this structure, it's significant that not all useful objects can be hyper-extended by the 
above, actually necessary, ultrafilter defined extension process. Indeed, because we are only using 
sequences with range values in IR, various members of V(V(TR)) cannot be extended. Further, there's 
a problem if the membership relation G is extended. Nonstandard analysis exists as a discipline only 
because the structures (IR, +,-,<, and (*B, *+, *, *<, can be analyzed externally since 

they exist as objects in the model of the set theory being used for their construction. In formal set 
theory as it might appear in Jech (1971), you find that the natural numbers have the property that 
0Gle2e3e4-- - and n ^ n. The € relation is said to be well founded because there are no 
types of sequences of members of this set theory that have this processed reversed. There are no 
objects such that ■ • • a £ b £ c £ d. If, however, the £ is extended to *<G for members of IR, then this 
*G is not well founded. 
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Example 2.13 Suppose that we do define the *£ for appropriate members of IR using definition 
2.10 and using a set theory like Jech (1971). Thus *£ is defined for each ne las the IN is defined 
within this set theory. Now let's define a collection of sequences from IN into K as follows, for each 
n £ 3N, let 

. , _ J 0; if i £ M and i < n 
■ tn{l) ~\i-n- Hi>n 

Here are what some of these sequences look like. 

/o(0) = 0, / (1) = 1, / (2) = 2, / (3) = 3, / (4) = 4, . . . ; 
/i(0) = 0, /i(l) = 0, /i(2) = l,/i(3) = 2, /x(4) = 3, . . . ; 
/ 2 (0) = 0, / 2 (1) = 0, / 2 (2) = 0, / 2 (3) = 1, / 2 (4) = 2, / 2 (5) = 3, . . . . 

Thus, the sequences after the "0" values have "shifting" range values. From the definition of *£, it 
follows that • • • [/ 2 ] *e[/i] *e[/o]. To sec this, take, say [/ 2 ], [/i]. Then / 2 (0) = i /i(0) - 0, / 2 (1) - 
i h(l) = 0, / 2 (2) = e /i(2) = 1, / 2 (3) = 1 e /i(3), .... Hence, {n | / 2 (n) G = {n \ n > 

ljeCcU. 

Thus, when viewed from the external set theory, *€E is not well founded and does not behave 
in the same manner as does £ . Further, the £ is used to define the "hyper" objects. In order to 
avoid this problem for the most basic level, the set TR is considered a set of atoms (Jech, 1971) or 
urelements or individuals (Suppes, 1960). This means that each member of El is not considered 
as a set and a statement such as x £ y where x, y £ M, has no meaning for our set theory. 
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3. HYPER-SET ALGEBRA 
INFINITE AND INFINITESIMAL NUMBERS 

Usually, it's assumed that we are working with one specific free ultrafilter. Is this of any 
significance for our embedding? 

Theorem 3.1. Let infinite X C IN. Then there exists a free ultrafilter U such that X E U. Let 
[A], [B] e *m. Then [A] = [B] for all free ultrafilter s iff {n \ A n = B n } e C. 

Proof. Let infinite X C M. Suppose that A E C and inl = 0. Then Icl-i=a finite 
set. Since C has the finite intersection property, this contradiction implies that C U {X} has the 
finite intersection property. Hence, there is an ultrafilter U such that C U {X} C U. Obviously, if 
{n | A n = B n } E C, then [A] = [B] for all free ultrafilters. Suppose that [A] = [B] for U and that 
{n | A n = B n } $l C. Then X = {n \ A n ^ B n } is infinite. Hence there is some free ultrafilter U\ and 
X E IA\ . Thus for this ultrafilter [A] ^ [B] and the proof is complete. | 

Later, for Theorem 3.11, I'll use this result to show that nonstandard objects contained in the 
same defined set may be considerable different if different free ultrafilter are used. However, the 
actual results obtained when this material is applied to real analysis are, unless otherwise stated, 
free ultrafilter independent. 

The objects that appear in each structure are also objects that can be discussed by means of 
the set theory of which these objects are members. It's possible to extend these structures to include 
other objects from this set theory. Shortly, the ^-transform process is introduced and the structures 
will be slightly extended to use this process in a technically correct manner. 

Theorem 3.2. *-Algebra. 

(i) *0 = 0. 

(ii) IfXcTR [resp. IR™], then a X C *m [resp. *{TR n )}. 
(hi) LfX C m, then *x E a X iff x E X iff *x E *X 

(iv) Let X,YCTR. Then X C Y iff *X C *Y. 

(v) Let X, Y C m. Then *(X — Y) = *X — *Y. 

(vi) Let X,Y CTR Then *(X U Y) = *X U *Y . Also, *(XDY)= *X n *Y. 

(vii) Let X C 3R. Then X is a nonempty and finite iff *X = a X. 

(viii) Let X\, . . . ,X n C IR. For the customarily defined n-ary relations, $ = (X± x • • • x 
X n _ L ) x X n iff *$ = * (X 1 x • • • x X„_i) x *X n = ( *X 1 x • • • *X n _ l ) x *X n . Thus, * (m») = (*TR) n . 

(ix) The statements (Hi), (iv), (v), (vi) and (vii) hold for IR n , n > 1. 

(x) For i > 1 and ^ $ C IR™, let Pi denote the set-theoretic i'th projection map. Then 
*(Pi(*)) =Pi(**). 

Proof, (i) If 5" = 0, then for any a € 1R N , {n \ A n e S} = £ U. Thus our hyper-set algebra 
yields that *0 = 0. 

(ii) This is simply a repeat of Definition 2.12. 

(iii) By definition, *x E ° X iff x E X. Now assume that x E X. Then, by definition, *x = 
[X n ], X n = x for each net Hence, {n \ X n E X} = BJ ElA. Thus, *x E *X. Conversely, assume 
that *x E *X. By definition, *x = [X n ] and X n = x for all n EM. Thus ^ {n \ X n = x} = W E U. 
Hence, x E X. 
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(iv) Let X C Y C TR. Then Ic«anda£ *1 iff {n | A n £ X} £ W. But, {n | A„ e X} C 
{n | A n £ Y}. Thus, {n | A n £ Y} £ U. Now assume that *X C *Y. Then for each x £ X, *x £ *X 
by (iii). Thus *a; £ *Y. Again by (iii) x £ Y. Thus X C Y. 

(v) First, notice that X - Y C m. Let a e * {X - Y). Then {n | A n £ (X - Y)} £ U} = U £ U. 
But, this implies that U C {n \ A n £ X} £ U and U C {n \ A n <£ Y}. Thus, {n \ A n <£ Y} £ U. 
Hence, a ^ *Y. Consequently, a £ *X— *Y. I'm sure you can establish the converse that a £ *X— *Y 
implies that a £ * (X - Y). 

(vi) The sets XUY and X CiY are subsets of TR. Now simply notice that the following identity 
characterizes the intersection operator. C = X C\Y = X - (X - Y). Thus, *C = *(X f]Y) = 
*X-(*X-*Y) = *Xn*y.Thcnae * (XUY) iff {n \ A n £ (XUY)} = {n \ A n £ X}U{n \ A n £ Y}. 
Hence, if {n \ A n £ (X U Y)} £ U, then cither {n \ A n £ X} £ U or {n \ A n £ Y} £ U. 
Thus, *{X U Y) C *X U *Y. Since X £ (X U Y) and Y C (XUY), it follows from (iv) that 
*X(J *Y £ *(XUY) and the result follows. 

(vii) The first part is established by induction. Let X = {x}. Then {x} C ffi.. By definition 
a £ *X iff {n | A„ £ {x}} = {n \ A n = x} £ U. Now *x = [X] and {n \ X n = x} = W £ U. 
Thus, {n | X n = B n } = {n \ X n = x} n {n \ B n = x} £ U implies that [X] = [B] = *x. Assume 
the result holds for a set with k members. Then * {xi, . . . ,Xk+i} = * {{xi, ■ ■ ■ , Xk} U {xk+\}) = 
*{x±, . . . , Xk} U * {xk+i} = { *x\, . . . , *Xk+\\ by the induction hypothesis and (v), the result holds 
for any k > 1. 

For the converse, let infinite X £ TR and assume that a X = *X. There exists an injection 
B: M -» X. Hence {B n \ n £ M} is an infinite subset of X. Let *x = [X] £ a X. Then X n = x £ X 
for each n £ 3N. But, {n \ X n = B n } is finite. Hence, [X] [B] since {n \ X n ^ B n } £ C. Also 
{n | B n £ X} = IN £ U implies that b £ *X. There is no x £ X such that *x = b £ *X implies 
a X ^ *X. 

(viii) The customarily defined notion of an n-ary relation can be found in Jech (1971). The first 
idea is that a 1-ary relation is but the subset of 1R and this has been established in (iii). The other 
cases, n > 1, for Xi £ Xi C It, 1 < i < n the Cartesian product X n is characterized by the statement 
that (x\, . . . , x n ) £ (X\ x • • • x A„_i) x X n iff Xi £ Xi, 1 < i < n, where the actual "Cartesian 
product" is defined by induction. That is X\ x X 2 x X 3 = (X\ xI 2 )xX 3 etc. (There are other ways 
to define the Cartesian product more formally just using 2-tuples and finite sequences.) Note that 
for any k > 1, {n | (^i(n), . . .,A k (n)) £ (X 1 x • • • x X fe _i) x X k } = {n \ A^n) £ Xi} D ■ ■ ■ D {n \ 
Ak(n) £ Xk}- The result follows from basic filter properties. 

(ix) Statements (iii), (iv), (v) are proved in the exact same manner for (*IR) n . Statements (vi), 
(vii) are proved by application of the method used in (vii) coupled with the characterization used 
to establish (Viii). 

(x) Let a £ *(Pi($)). Then {n | A n £ (P,($))} £ U iff {n \ there exist B u . . .,B^ u B l+1 , 
B m such that (Bi(n), . . . , Sj_i(n), A(n), B i+ i(n), . . . , B m (n)) £ $} £ U. Hence, there exist bj,l < 
j < m, i 7^ j such that (6j, . . . , a, . . . , b m ) £ *<f>. But from the definition of such a 
projection this gives that a £ P^(*$)). Thus *(Pj($)) = Pj(*3>) because of the equivalence of 
the two set-theoretic statements. (At present, I have not introduced a more formal way of writing 
definitions for such sets.) | 

Important. Theorem 3.2 involves properties about our original structure (TR, +, ■, <, $,) and 
the (*TR, +,-,<, and the embedded objects. Although the embedded objects "behave" like 
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the original objects, they're still different from these. This observation will come into play when I 
discuss the notion of *-transform. 

It's about time that I demonstrated that the "ideal" numbers used by Leibniz, that did not 
really exist mathematically until 1961, exist within *B. These are the infinitesimals which solve this 
three hundred year old problem. 

Definition 3.3. (Infinite and infinitesimal numbers.) As usual define the absolute value 
function (i.e. binary relation) for members of a € *M by requiring as *\a\ = \a\ = b iff {n \ \A n \ = 
B n } S U. Although, I won't show it, but later will show how to establish the fact, this function * | • | 
has the same mathematical properties as does | • | for members of IR. So, I have written it as if its a 
restriction to our embedded a TR of the usual absolute value function. An a e *TR is infinitely large 
or simply an infinite number or shorter still infinite iff *x < \a\ for each *x <G CT IR. (Some might 
go back to the pre-embedded IR for these definitions, but remember we are thinking of a TR as our 
actual set of real numbers.) A b E *TR, is an infinitesimal or as Newton stated infinitely small iff 
< \b\ < *x for each x € TR + , the set of all positive real numbers. 

Now that these "new" types of numbers are defined, do any exist? 

Example 3.4. Let A be the member of 1R N with the property that Ak = k for each k e IN. 
Let *x <E a TR. Then there exists some to € 3N such that |a;| < to. Hence, |a;| = \X n \ < A m = to for 
each n e IN implies that {n \ A n > \X n \} D {to, m + 1, . . .} £ C C U. Thus, a is an infinite number. 
There are a lot more. 

Note that * is the trivial infinitesimal. Consider, the sequence G„ = 1/n, n G IN — {0} 
and Go = 0. Then j / *0. Now for each x e TR + there is some to e IN, to ^ such that 
< 1/to < x. Thus *0< T/*to< *x. (Note: So far we would need to establish such statements by 
ultrafiltcr properties. But, this is really trivial because by definition each *x is the constant sequence 
representation.) Now IN — {n \ G n > X n } is a finite subset of IN. Hence, {n \ < G n < X n } eCcW. 
Thus, g is an infinitesimal. Indeed, once we get one nonzero infinitesimal, we can generate infinitely 
many. 

Definition 3.5. A a E *TR is finite or limited iff it's not infinite. That is if there is some 
*x e *IR + (positive hyperreals) such that \a\ < *x. The set of all finite numbers is denote by 
G(0), the galaxy within our universe *TR in which CT IR resides. (Note: If a € *3R, then a e G(0) iff 
there is some *y such that \a\ < *y.) The set of all infinitesimals is denoted by /i(0). 

What is the algebra of the infinitesimals and does this algebra display the exact algebra used by 
Newton and Leibniz? It's customary to let lower Greek letters represent nonzero infinitesi- 
mals. Then as one would expect capital Greek letters represent infinite numbers. I'll show 
later that many real valued functions defined on open intervals about zero preserves infinitesimals. 
Indeed, if x > and /: (— x,x) — > IR is continuous at x — and /(0) = 0, then */(e) = A or 0. 
(Notation: "/ is a function that takes each and every member of (— ar, x) and yields members of IR.") 

We know that *m is a totally ordered field and /x(0),G(0) C *B. Also, e fj,(0) n G(0). The 
next Theorem, 3.8, gives exactly how the relations +, •, < behave when they are restricted to /z(0) 
and G(0). It will turn out that both of these sets are totally order rings with no zero divisors. 
What does this mean for the relations *+, *, *<? This means that /x(0) and G(0) behave for these 
binary relations exactly like the integers {• • • , —3, —2 — 1, 0, 1, 2, 3, • • •} behave with the one exception 
that *1 ^ A*(0). By the way, if your interested, the x,y are zero divisors iff xy = implies that 
x = or y = 0. 
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Certainly, establishing results by using the basic properties of the ultrafiltcrs is getting a bit 
tedious. There most be a better way. And, there is. If a statement about (It, +, •, <, 3>j) is expressed 
in a special way and that statement holds, then there is a process that's used to show that a 
altered statement holds in (*TR, +,-,<, The process is called ^-transform. However, to do 

this properly it's necessary to extend the structure considerably. 

(It's not really necessarily that you fully understand the contents of my new extended struc- 
ture. You could just go immediately to Definition 3.6 and simply restate Theorem 3.7 only in 
terms of the notation M. and *M. and without the structures being specified.) Because of the 
way I have defined the *-extension operator in Definition 2.10, the structure I technically need is 
(m, . . . , TR n , . . . , V(TR), . . . , V(TR n ), ...,+,-, <). Although not actually necessary I have identified the 
three indicated binary relations used previously. The actual n used in practice is rather small, 
usually. Let each element of each of the objects in {M, . . . , M", . . . , "P(]R), . . . , "P(IR"), . . .} have a 
"constant" name and we use +, •, < and the like as the "names" for these specific objects. Also the 
constant that "names" a mathematical object itself will not be differentiated from the mathematical 
object itself. Let Cn be this set of all of these constants. Notice that "constants" that are mem- 
bers of a n-ary relation n > 1 like (x,y,z), use the constants x,y,z from B. These n-tuple forms 
(x\, . . . , x n ) are part of our language. 

Definition 3.6. (*-transform) Consider any properly formed statement (formally a first-order 
formula with equality and constants using the atomic formula in the appendix) with bounded quan- 
tifiers and only using members of Cn. Then the ^-transform of this statement is obtained by writing 
a * to the left as a superscript of each constant. Also, there is the reverse process where a statement 
in terms of the Cn is obtained by removing the *. 

I'm not going to present a course in first-order logic in this monograph. So, you'll simply need 
to assume that I've expressed the "formal" statements in the proper bounded form. This means 
that the "variable" that appears to the right of a quantifier, the universal "for each," Vx, and the 
existential "there exists some," 3x, must vary over one of the sets in the standard structure. Of 
course, it turns out that mathematicians seem to always write their informal sentences in forms 
that are logically equivalent to these bounded forms. The reason for the bounded form is that in 
the appendix Theorem A3 establishes the following without using the Axiom of Choice. Only some 
previous results obtained using ultrafiltcrs are needed. Notice in what follows two new symbols are 
introduced for the respective structures and the structures are now extended slightly. 

Theorem 3.7. Let S be any sentence in bounded form that uses only constants in Cn. Then 
S holds for M = (1R, . . . , 1", . . . , "P(IR), . . . , V(TR n ), ...,+,-,<) iff the *-transform of S holds in 

*M = (*m, . . . , *m", . . . ,V{*tr), . . . ,-p(*m"), ...,+,-, <). 

WARNING If someone who has experience with nonstandard analysis reads Theorem 3.7, they 
might state that the theorem is in error, since I have written V(*TR n ) 7 etc., in the structure. However, 
it is correct as shown in the appendix for the language being used. For example, an expression such 
as 3x(x G V(*lRj) is not in the proper *-transfer form. This theorem only applies to 3x(x e *P(*M)). 

I don't suppose that you noticed that *-transform is a reversible process that relates our original 
structure M and the nonstandard structure *M and does not technically mention the embedded 
objects. Here's one place where there is a different notational approach. In some work, the original 
structure and the embedded structure are consider as identical. I will probably not do this if there's 
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any possible confusion. Also there are actually certain properties of M that can't be expressed in our 
formal language. The language could be enriched. But, if this is done one might as well go all the 
way to the object called a superstructure. The idea is to see what can be accomplished without 
such an enriched language and the additional complications this would produce. 

Theorem 3.8. The sets /z(0),G(0) are totally ordered subrings of*TR with no zero divisors and 
G(0) has an identity. 

Proof. I start with G(0) since it contains ri(0) and G(0) C *1 to show that it is a totally 
ordered ring with no zero divisors, all that is needed is to show that it is closed under the operations 
+, -.To do this efficiently Theorem 3.4 is used. Informally, we know that if we are given any two 
real numbers x, y, then \x + y\ < \x\ + \y\. The formal bounded statement of this fact is 

VxVy((x G m) A (y G m) -» |x + y| < |x| + |y|) 

holds in A4, and, hence, its *-transform holds in *M. Thus, 

VxVy((x e *H) A (y e *IR) -> |x + y| < |x| + |y|) 

is a fact about *A4. (Note: You could have written | • | as *| • |. I also point out that this is actually 
considered as written in the form for a particular where we have in our language the ordered 
n-tuple notation. Define = {(w, y, z) \ \w\ < \y\ + \z\}. Then |x + y| < |x| + |y| is equivalent to 
((w,x,y)e $j) A(w = x + y).) 

Thus, the triangle inequality holds in *TR. So, let a, b G G(0). Then there are standard *x, *y G 
CT ]R such that \a\ < *x, and |6| < *y. But \a + b\ < \a\ + \b\ < *x+*y= * (x + y) from our definitions 
and the order properties of of *IR. This gives that a + b G G(0) which gives us closure under + 
since *0 G G(0). In like manner, one gets that ab € G(0). Of course, since G(0) C *TR the members 
have the usual associative, commutative and distributive properties and *0 is its zero. Now either 
by ""-transform or filter properties * 1 is also an identity in G(0). It now follows immediately that 
since *TR is a totally ordered field, then G(0) is a totally ordered ring. Further, since *]R has no zero 
divisors neither does G(0). 

Next, consider /i(0). We can apply the method used to establish that G(0) is a totally ordered 
ring with no zero divisors (but 1 <^ /«(0)) to show that ^(0) is a totally ordered ring with no zero 
divisors. The only difference in the proofs is that instead of writing that there is some *x > such 
that \a\ < *x, we have that e € /z(0) iff |e| < *x for all arbitrary *x > 0. | 

Does /x(0) have any other significant algebraic properties? The answer is yes and it's this 
most remarkable property that's needed if its members are to mimic the "infinitely small" notion of 
Newton. What /j,(0) does is to "absorb" via multiplication every member of G(0). It has this "ideal" 
property. Aid G(0), is an ideal iff it is a subring (which ri(0) is) and for each a G G(0) and each 
b G I, the product ab G /. An ideal / C G(0) is maximum iff for any other ideal I\ D I, I = I\ or 
/ = G(0). 

Theorem 3.9. The set of infinitesimals fj,(0) is a proper maximum ideal in G(0). 

Proof. Let a G G(0) and e G ^(0)- Then there is some *x G a TR such that \a\ < *x. Let 
e = g = [G]. Consider arbitrary positive *y. Then 

F = {n | \A n \ < X n = x} n {n \ |G„| < Y n = y} G U. 
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However, {n | |A„G„| < xy} D F. But xy is also arbitrary. Hence, ae G ^(0)- 

Let / be any ideal in G(0) such that /i(0) C /. Assume that there is some b G I— /i(0). Then b ^ 
and there exists some positive x G H such that {n | |_B n | >r}eii. Hence, {n \ \1/B n \ < 1/x} G W. 
Consequently, [-B -1 ] = b^ 1 G G(0) implies that *1 = G /. This last fact will always force 

/ = G(0) since it's an ideal. Well, take any ^ i e 1. Then *x ^ *0 and *x ^ fi(0) implies that 
ri(0) ^ G(0). Hence, /x(0) is a proper maximum ideal. | 

I could go into some other abstract algebra material and use the language of quotient rings, 
isomorphisms and kernels to show exactly how G(0) and ^.(0) are related to CT IR, but it's unnecessary 
to do this for this simplified approach. It's enough to say that the properties of /z(0) exactly match 
the "infinitely small" of Newton and the "ideal numbers" of Leibniz. 

It has become customary to drop the * from the members of a TR when there is no confusion. 
I'll start doing this in the very important next definition. 

Definition 3.10 (Monads of standard numbers.) Let x G ""SR. Then the monad of (about) x 

is the set ri(x) = {x + e \ e G /u(0). The only standard object in /j,(x) is x. (Recall that when there's 
no confusion, I might use x in place of *x.) 

Before showing a remarkable relation between the monads and G(0), I need the next theorem. 

Theorem 3.11. Let A n be a sequence of real numbers. Then [A] G ri(x) for every free ultrafilter 
iiflimn—oo A n = x. 

Proof. First, note that, for a fixed free ultrafilter U and its monad n(x), [A] G ji{x) iff there is 
some e G /u(0) such that [A] = x + e, [A] - x = e iff [A] - x g /ti(O) iff {n \ \ A n - X n \ < r} g U for any 
arbitrary positive r. Let U be any free ultrafilter and assume that A n — > x. Then for arbitrary positive 
r, we have that \A n — x\ < r for all but a finite number of A n . Thus, {n \ \A n — X n \ < r} g C C U. 
But r is arbitrary implies that [A] g /i(x). 

Conversely, assume that A n -f^ x. Then there is a positive r such that X — {n \ \A n — x\ > r} 
is an infinite set. Any infinite subset of IN is contained in some free ultrafilter U\ by Theorem 3.1. 
Thus, for this Ui, [A] ^ Hi(x) since the complement of X is not a member of U\. 

Theorem 3.12 The collection {ri(x) \ x g CT IR} is a partition for G(0). 

Proof. Technically, to be a partition of G(0), one must have that /j,(x) n n{y) ^ implies that 
ri(x) = /j,(y) and that \J{^{x) \ x g CT IR} = G(0). For the first part, assume that there exists some 
a G /i(a;) H /u(?/)- Then a = e + .x, a = A + y. But, e + x = A + y implies that e — A = y — x. This 
is only possible if e — A = since y — x g CT IR. Thus x = y. Let a € U{m( x ) I x *= Then 
a = e + x for some x G CT ]R. Then \a\ = \e + x\ < |e| + |x| < \x\ + 1. Hence a e G(0). Consequently, 
U{mW I z G ff 3R} C G(0). 

Now assume that a g G(0). Rather than continue to use the properties of the our free ultrafilter, 
let's just consider the properties of <. Hence, there is some *x g CT IR + such that a < *x. So, consider 
the set S = {y \ *y < a} This set is nonempty since — x g 5. Also since a < *x, 5 is set of 
real numbers that's bounded above and as such has a least upper bound z. The number z needs 
to be located. Assume that \z — a\ is not an infinitesimal. Thus there is some w G 11 such that 
| * z — a\ > *w. Suppose that * z < a. Then a— * z > *w implies that * z+ *w = * (z + w) < a implies 
zlioeS and z is not the least upper bound. So, let a < *z. This implies that a < * (z — w) < * z. 
But, z — w is an upper bound for the set S. This contradicts the least upper bound property for z. 
Hence, * z — a = e implies that a G [t(z). | 
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Of course, this implies, in general, that {J{fi(x) | x G a 3R} = G(0) is free ultrafilter independent. 
But, the monads that contain some of the members of ~SR N cannot be readily determined. 

Example 3.13 Consider the sequence a = {1,-1,1,-1,...}. Then as done in the proof of 
Theorem 3.11, U\ = {n \ A n = 1} n U2 = {n \ A n = —1} = 0. The set U\ is a member of the free 
ultrafilter U\ and [7 2 is a member of the free ultrafilter U2 where U\ ^=U2- Further, a G Mi(l) and 
a G M2(-l)- 

There are some other useful properties that relate members of *IR, and the sets G(0) and fi(Q) 
and show that they model the older notions of real number "infinities" and the "infinitely small." 
Let *IR— G(0) = IRoo be the infinite numbers. It's immediate from the definition that if a, 6 G IRoo> 
then ab G ffioo- If < a [resp a < 0] G IRoo and a < 6 [resp. b < a], then b G Hoc since a > r [rcsp. 
a < r] for each r € 1. 

Theorem 3.14 

(i) If be TRoo, then 1/6 G /x(0). 

(ii) IfO^ee /z(0), tften 1/e e 1 M . 

(iii) Le£ e G ^(0), 6 G T^/ien e + b G /Li(a;) a^rf e 6 G /x(0). 

(iv) If b £ IRoo, afid *a; 7^ *0, t/ie« &*x G Hoc. If a £ G(0) - /i(0), i/ien 6a G IRoo- (TTie 
*IR oc almost has the special property associated with an ideal.) 

(v) If *x < *y, then *x + e < *y + A /or any e, A G /i(0). 

Proof. (Most mathematicians would consider these proofs as trivial and would "leave them to 
the reader." But, I'll do most of them.) 

(i) If 6 G IRqo, then for any *x G CT IR+, *x < \b\. Thus by field properties, 1/|6| < *x. This says 
that 1/|6| G /x(0). 

(ii) Same method as (i). 

(iii) Let e G fi(0) and 6 G fJ.(x). Then 6 = *x + A implies that e + 6= *a; + e + A= *x + 7G /u(0). 
Then e 6 G /z(0) from Theorem 3.9 or e b = e *x + £7 = a + f3 G n(0). 

(iv) Using (i), 7^ l/(*x6) G At(0). Now use (ii). For the second part, use the fact that if 
6 G G(0) — /i(0), then if a > *0, there is some *x > such that *x < a and if a < 0, then there is 
some *y such that a < *y. Now apply the remark I made just prior to this theorem. 

(v) Assume that < e - A G /u(0). Hence, for a < 6, < e - A < 6 - a. Thus a + e<6 + A. | 

The fact that the *x) \ *x G a TR} forms a partition of G(0) immediately defines for all 
members of G(0) an equivalence relation of some importance, where this relation is a short hand for 
a member of G(0) being in a unique fi(x). 

Definition 3.15. (Infinitely close (near) equivalence relation.) Two a, b G G(0) are 
infinitely close iff a — b G /u(0). This relation is written as a w 6. 

We almost have enough of the basic machinery to continue with real analysis. But, there is one 
last major procedure that needs to be introduced, the "standard part" operator. 

Definition 3.16. (The standard part operator, st.) Using Theorem 3.12, there is a 
function st on G(0) into a TR such that, for each n{x), st(ri(x)) = *x <-> x. Once the properties of 
st are obtained, then, usually, one further allows st(//(a;)) = x G B. The function st is called the 
standard part operator. 
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Most of the results in the next theorem would be what one would expect. 

Theorem 3.16. Let st:G(0) — > CT ]R (K) be the standard part operator. Then for each a, b G 



(i) st(a ± b) = st(a) ± st(6). 

(ii) st(ab) = st(a)st(6). 

(iii) If a <b, then st(a) < st(6). 

(iv) st(|a|) = | st (a) |, st(max{a, b}) = max{st(a), st(b)}, st(min{a, b}) — 
min{st(a), st(6)}. 

(v) st(a) =0iffae /z(0). 

(vi) For any *x, st(*;r) = *x. 

(vii) The st(a) > iff \a\ G ^(st(a)). 

(viii) a~b iff a — b G ^(0) iff st(a) = st(6) 

(ix) // st (a) < st (b), then either a — b G ^(0) or a < b. 

(x) If *0 < c [resp. c < *0] and c G *TR OD , then for a > *0, *0 < c + a G *]R 00 [resp. 
a < *0, c + a G *m oo ]. 



Proof. I'll do (iii) and leave the others to the reader. Let a, 6 G G(0), a < b. Then a G 
/i(st(a)), 6 € /i(st(6)) implies that a = st(a)+e, b = st(6)+7 implies that < st(6) — st(a)+7 — e = 
st(6 — a) + 7 — e implies that st(a) < st(b) since the monads are disjoint! 

Is it clear that if INoo = *M - a M, then INoo C IRoo? 

Theorem 3.17. The set of infinite natural numbers Woo C IRoo and, for each n G M *n < A 
for each A G . 

Proof. In example 3.4, the infinite number defined is actually a member of ~SR K C *IR. Thus, 
*M- CT IN 7^ 0. For each m G CT IN, there is the ra+1 G CT IN C a TR. Hence, CT M C G(0). Let ae*!-"! 
and a G G(0). Then since *0 < b for each 6 G *M, there is some r G CT 3R such that *0 < \a\ = a < r. 
But, we know there is some m G 3N, hence, *m G ^BJ such that *0 < a < *m. *-transform of the 
statement "for each x, for each y, for each z, if x G IN and y G IN and z G IN and x < y, then z G [x, y] 
iff < z < y" or formally Va;VyVz( (a; G M)A(y G W)A(z G W)A(a; < y) — > (z G [x,y] ^ (0 < z < y))) 
holds in *A4 and characterizes the set *[*0, *m]. Thus, a G *[*0, *m]. But, since [0,m] is a finite 
set, then Theorem 3.2 (vi) implies that a = *n for some *n G CT IN. This contradiction implies that 
INoo C Boo as one would expect. The second part follows immediately. | 



G(0), 
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4. BASIC SEQUENTIAL CONVERGENCE 

One intuitive statement about sequences of real numbers states something like "all the con- 
vergence properties are determined by the behavior of the infinite tails." In fact, for elementary 
converges, we have "Well, the values of the sequence get nearer, and nearer, and nearer and stay 
near to the limit no matter how far you go out in the series." Does the nonstandard theory of 
sequential convergence model both "getting nearer, and nearer" and "staying near" simultaneously? 
Indeed, you'll find out that, for convergence, the infinite tails are all members of G(0). More- 
over, each nonstandard characteristic based directly upon a definition is stated in, at least, one less 
quantifier. Godel considered that just removing one quantifier from any characterization is a major 
achievement within mathematics. By the way, all of the results presented in the remainder of this 
book are free ultrafiltcr independent. Also, many of the definitions and proofs presented are easily 
generalized to the multi- variable calculus. (I'll use the notation n e *IN where there is no confusion 
as to the location of the n. Usually one might write this as a € *IN.) 

Theorem 4.1. A sequence S: IN — > IR is bounded iff *S(n) e G(0) for each n £ *M iff 
(*S[*M] C G(0).) 

Proof. Let S be bounded. Recall what this means. There exists some x e TR + such that 
"for each n g IN, \S(n)\ < x" or Vy((y g IN) — > (IS'(y)l < a;)) holds in M. By *-transform, 
W((y € * m ) —> ( 1 *^(y) 1 < * x )) holds in *M. (Note: We can consider with respect to our embedding 
that | • | is but a restriction of * | • * | and we need not use the * there, although this is but a notational 
simplification.) Hence, for each n € *IN, *S(n) e G(0). 

Conversely, for each n g *M, let *S(n) G G(0). We know that there is a b € *IR+ C *IR + such 
that for each c g G(0), \c\ < b. Hence, 3x((x g *1R + ) A Vy((y g *3N) -> | *S(y)\ < x)) holds in *M. 
Thus, the statement 3x((x e IR + ) A Vy((y e IN) — > \S(y)\ < x)), obtained by dropping the *, holds 
in M and the sequence is bounded. | 

What about the "near to L" and "stays near" intuitive notion and it's relation to the "true" 
infinite part of the tail? 

Theorem 4.2 A sequence S: IN — > IR converges to L e IR (S n — > L) ^ *5(A) — L e ^(0) /or eac/i 
A € INoo jff *5(A) e /or eac/i A e «ffst(*5(A)) = L /or eac/i A e i# ( *S'[]N 00 ] C fi(L).) 

Proof. Let S: IN — > IR converge to L. Let y G IR + . Then we know that there exists some m £ IN 
such that for each k € IN where k> m, \S(k) — L\ < x. Hence, the statement 

Vx((x G IN) A (x > to) — > (|5(x) -L\< y)) 

holds in M; and, hence, in *A4. In particular, by ^-transform, for each A <G INoo, | *S(A) — L\ < *y. 
Since, y is arbitrary, then *S(A) - L g ^(0) for each A g INoo. Hence, *5"(A) g fj,(L) and st( *S(A)) 
I for each A g INoo- 

Conversely, assume that ( *S(K) — L) g /u(0) for each A g INoo- Let y g IR. Since INoo 7^ 0, then 
by Theorem 3.17, the sentence 

3z((z g *T&) A Vx((x g *U) A (z < x) -> (| *5(x) - L| < *y))) 

holds in *A4. Thus, it holds in A4, by reverse *-transform. But, this is the standard statement that 
S n — > All the remaining "iff" are but restatements of *5(A) — L g yu(0) for each A g INoo. | 
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Corollary 4.3 All the basic limit theorems for sums, products, etc. all follow from Theorem 
4-2 and the properties of the "st" operator. 

Examples 4.4 

(i) (l/n) p — » 0, n,p > 0, p £ M. We know that for each nonzero A £ Moo, (1/A) P £ fj,(0) 
and the result follows. (If we had result that the continuous function f{x) = x p , p > 0, preserves 
infinitesimals, then we could extend this to any p > 0. But, maybe it's better to use sequences to 
motivate continuity.) 

(ii) x n — > 0, < \x\ < 1. In general, for any n,m £ I such that n < m, we have that 
(l/|a;|) n < (l/|ir|) m . For any y £ TR, there is some n £ IN such that \y\ < (l/|a;|) n . Hence, for each 
A e DJoo (l/|a;|) A e 3Roo- Consequently, x A £ /z(0) for each A £ W^. 

(iii) Let < x, x^ 1. Then x x l n -> 1, n > 0. Consider that case that x > 1 and S n — x * n — 1 . 
Then x = (S n + 1)™. Hence, x > nS n for each n > 0. Thus, by ^-transform, x > A*Sa for each 
A £ Moo. Consequently, < *S(A) < (x/A) £ fj,(0) for each A £ W^,. Thus *S(A) £ fj,(0), A £ "W^ 
and result follows in this case. 

Now, if < x < 1, then 1 < 1/x and, as just shown, (1/.t) 1/a £ fi(l), A £ TN^. Thus 
(l/x)^ A - 1 = e £ /i(O). Hence, 1 - x^ A = e(x^ A ) £ fj,(0), since by ^-transform < x 1 ^ < 1. 
Therefore, x 1//A £ for this case also and the complete result follows. 

(iv) n 1 /™ — > 1, n > 0. Consider again the sequence S„ = n 1 /™ — 1. Then n = (1 + S^)™ = 

ELi (fc) > Sl n > 1. Thus, < 5„ < (^r) 1/2 , n > 1. By *-transform, and in 

particular, < *S A < ( x^j-) 1/2 , A £ W^. But, {-^) 1/2 £ £t(0). Hence *5(A) e fj,(0) and the result 
follows from the definition of S n . 

It seems that some of the above algebraic manipulations are what one might do if these limits 
were established without using nonstandard procedures. There are major differences, however, in the 
number of quantified statements one needs for the standard proofs as compared to the nonstandard. 
Let's establish a standard result by nonstandard means. 

Theorem 4.5. Every convergent sequence of real numbers is bounded. 

Proof. Let S n -> L £ H. Then *S{A) £ fi(L) C G(0) for each A £ B^. Since *S[ a W] C G(0), 
the result follows from Theorem 4.1. | 

Theorem 4.6. v4 sef of real numbers B is bounded iff * B C G(0). 

Proof. If £? is finite, then it's immediate that *B C G(0). If B is infinite, then there is some real 
number x such that for each y £ B, \y\ < x. By *-transform of the obvious expression any a £ * B 
has the property that \a\ < *x. Consequently, * B <Z G(0). 

Conversely, if B is not bounded, then for n £ IN there is some x £ B such that \x\ > n. Hence, 
by ^-transform, there is some p £ * B such that \p\ > A, A £ Moo. From the remark made prior to 
Theorem 3.14, p ^ G(0) and the converse follows. | 

One of the first big results one encounters in sequential convergence theory is a sufficient condi- 
tion for convergence. Recall that a sequence is monotone iff it is either an increasing or decreasing 
function. The following characterization is what would be expected, that for monotone sequences 
only one infinite number is needed for convergence. 
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Theorem 4.7. If S: M — > B is monotone and there exists some A e BIoo swc/i that *S(A) G 
G(0), tfien 5„ -» st(*S(A)). 

Proof. Simply assume that S: M — * M is increasing since the decreasing case is similar. I first 
note that *y = st(*5(A)) e a TR. By *-transform, the extension *S:*M — » *IR is increasing. Thus 
for A G INoc and for each *m G ff ffiT, *5( *m) < *5(A) and, since *S'(A) G G(0), the st( *S{*m)) = 
*(S(m)) G ]R. Consequently, for M, then following sentence 

Vx((x G M) (S(x) < y)) 

holds in .M; and, hence, holds in *M. So, let G BJoo- Then *5(f2) < *y = st( *S(A)); which implies 
that for each Q G I M , *5(fi) G G(0). Thus st(*S(Q)) G CT ]R for each such and st(*5(fi)) < *y. 
Let Q > A. Then *5(A) < *S(fi); which implies that *y < st(*S , (il)). But, since the above 
statement still holds for such a f2, then *5(fi) < *y implies that st(*5(0)) < *y. Let Q, < A. Then 
*5(fi) < *S(A); implies st(*5(0)) = z and the above statement holds for z.. Thus, st(*5(A)) < 
st(*5(0)). Hence, st(*5(fi)) = st(*5(A)) for each G I M . Consequently, *S*(f7) - *S(A) G /i(0) 
for all G Bloc implies that *S(fl) G M( st ( *^(^-))) f° r eac h ^ G ^oo and the result follows. | 

Corollary 4.8. A bounded monotone sequence converges. 

Proof. By Theorem 4.1. 

Please note that if a — 6 G /u(0), and b G G(0), then the intuitive statement that a G /i(st(6)) 
does, indeed, hold. In a slightly more general mode recall that for a sequence S: IN — > IR. a real 
number w is an accumulation point or limit point for S iff for each r G IR + and for each n G M, 
there is some m £ I that m > n and |/S m — iy| < r. This definition allows 1 to be an accumulation 
point of sequences such as {1, 1/2, 1, 1/3, 1, 1/4, 1, . . .} where both 1 and are accumulation points. 
This definition does not correspond to most of the accumulation point definitions for point-sets. 
However, there will be a another term used in chapter 8, that does so correspond. 

Theorem 4.9. 

(i) A w G TR is an accumulation point for S:M — > TR iff there exists some A G Moo such 
that *S(A) G fi( *w) = /i(st(*5(A)). 

(ii) A sequence S: M — > K has an accumulation point iff there exists some A G Moo such 
that *S(A) G G(0). 

Proof (i) Let to G 3R be an accumulation point for S. Then the sentence 

VxVy((x G m+) A(yel)^ 3z((z G M) A (z > y) A (|5(z) - io| < x))) 

holds in *jM by *-transform. So, let < e G /i(0) and G 3Noo- Then there exists some A G *M such 
that A > tt and | *5(A) - *w\ < e. Hence, *5(A) G n(*w). Clearly, A G M x . 

Conversely, assume that there exists some A G INoo such that *5(A) G fi(*w), w £ M. Note 
that n( * w) C *(w - y,w + y) for each y G IR + and that A > *n for all *n e a M. Hence, for given 
*w, *y > *0 and a given *m, we have that 

3x((x G M) A (x > *m) A (| *5(x) - *w\ < *y)) 

holds in *A4; and, hence, in by reverse *-transform. The result follows, 
(ii) This follows "immediately" from (i). | 
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Theorem 4.10. (i) A sequence S: IN ~ * TR has a subsequence that converges to w & TR iff there 
exists some A G INqo such that *S(A) G 

(ii) A sequence has a convergent subsequence iff there is some A G such that *S(A) G 
G(0) *#(*S[BJoo]nG(O)^0). 

Proof, (i) Assume that for A G EJoo that *S(A) G fi(*w). Then w is an accumulation point. 
You start with n = and take y = 1. Then you have an S m such that \S m — w\ < 1. Let S' = S m . 
Now take y = 1/2 and consider the next as the one for k > m and l^fe — u;| < 1/2. This idea 
can be restated in an induction proof for the other 1/n with no great difficulty. This subsequence 
obviously converges to w. (Have I used the Axiom of Choice to obtain the Sfc?) 

On the other hand, if S': IN — > TR is a subsequence of the sequence S and it converges to L, then 
*S"[IN 00 ] n G(0) 7^ implies, since *5"[IN 00 ] C *5'[IN 00 ], that L is an accumulation point by Theorem 
4.9. (ii) is obvious. | 

Theorem 4.11. A bounded sequence has a convergent subsequence. 
Proof. From Theorems 4.1 and 4.10. | 

Have I convinced you that the notion of what happens with the truly infinite tail piece of a 
sequence does determine all that seems necessary for basic convergence? No. Well, let's look at 
another idea, the special types of divergence written as S n — > +oo [rcsp. — oo]. 

Recall that a sequence S n — > +oo [rcsp. — oo] iff for each y > [resp. y < 0] there exists an 
to G IN such that for each n G IN such that n > to, S n > y [resp. S n < y]. How do we intuitively 
state such stuff as this? One might say that S converges to "plus infinity" or converges to 
"negative infinity." But, in basic real analysis, the "numbers" ±oo do not actually exist. 

Theorem 4.12. For sequence S: IN — > IR, S n — > +oo [resp. — oo ] iff for each A G IN^ 
*S(A) G IR+ [resp. IR~ ], where Et+ = {A | *0 < A G IRoc} [resp. IR~ = {A | *0 > A G IRoo}], iff 
(^[mj C m+ [resp. B"]). 

Proof. Assume that S n — > +oo. We can assume that 5„ > for each n G IN since it is not 
true for only finitely many n. Suppose that there exists some A G INoo such that *S(A) ^ IR+ . 
Thus, *5(A) G G(0). Therefore there is a subsequence of S, S': IN — > IR and *S'' l — > L, Lei and 
L = st(*5'(A)). Thus, there exists an to G IN such that for all n > to, \S'(n) — L\ < 1. Hence, for 
each such n, < S"(n) < L + 1. Thus, considering y = L + 1 there does not exist a p G IN such that 
for each n G IN, where n > p, S n > y. 

Conversely, suppose that for each A G Moo, *5(A) G IR+ . Let y > 0. Consider G Uqq. If 
A > 0, then A G INqo and under the hypothesis, *S(A) > *y. Consequently, the sentence 

3x((x G *IN) A Vz((z G *IN) A (z > i) -» ( *S(z) > *y)) 

holds in *A4 and, hence, in A4. This result for the positive infinite numbers follows by reverse 
*-transform. The case for the negative infinite numbers follows in like manner and the proof is 
complete. | 

By the way, notice how easily the next result is established. 

Theorem 4.13. If S : M — ► TR converges to L G IR, then L is unique. 
Proof. If L ^ M G IR, then ri(L) n n{M) = 0. | 
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Let's recap some the significant nonstandard characterizations for a sequence S: TO — > B. Notice 
that they are all quantifier (V, 3) free. 

Theorem 4.14. Given a sequence S: TO — ► IR. Then 

(i) S is bounded iff *S[*W] C G(0), 

(ii) S n L iff *S[m 00 ] C a*(L) C G(0), 

(iii) S 1 /wis a convergent subsequence iff *S'[M 00 ] n G(0) 7^ 0. 

(iv) s n ^±™ iff *s[w OQ ] c m±. 

I wonder whether S: TO — > B has a subsequence that converges to ±00 iff *S'[TO oc ] n ffi.^ ^ 0? 
So far, to show that a specific sequence converges we needed to guess at what the limit might be. 
One of the more important notions was considered by Cauchy, the Cauchy Criterion, that for the 
real numbers characterizes convergence without having to guess at a limit L. A sequence S is called 
a Cauchy sequence iff for each y £ TR + , there is some m £ M such that for each pair p, q £ TO such 
that p,q>m, it follows that \S(p) — S(q)\ < y. 

Theorem 4.15. (Nonstandard Cauchy Criterion.) A sequence S: TO — > K is Cauchy iff 

*S(A) - *S(Q) £ n(0) 

for each A, f2 £ W^. 

Proof. For the necessity, simply let real y > 0, then there exists some m y £ TO such that the 
sentence 

VxVz((x Gl)A(zeI)A(z> m y ) -> (|S(x) - S(z)| < y)) 

holds in M. and, hence, in *AA. In particular, if A, 17 £ M^, then A, SI > *m y for any such m y 
implies that | *S(A) - *S(fl)\ < *y for any y > 0. Consequently, *5(A) - *S(fi) £ /i(0). 

The sufficiency follows in the usual manner since /x(0) C * ( — y, y) for each y > and TOoo ^ 
imply that the sentence 

3w((w £ TO) A VzVx((z £ TO) A (x £ TO) A (x > w) A (y > w) -> (|5(x) - 5(z)| < y)) 

holds in At and the proof is complete. | 

Theorem 4.16. A sequence S: TO — > M converges iff it is Cauchy. 

Proof. Suppose that S„ — > L e 3R. Then for each pair A, £1 e M^, *S'(A) - L £ fi(Q), and 
- Is /i(0). Hence, "S(A) - *S(fi) e /x(0). 

For the converse, let 5: TO — » m be Cauchy. Then for A, He TO^, we have that *S(A) - *S(fi) £ 
/i(0) from Theorem 4.15. Let A e M x and *S(A) € G(0). Then *S[M 00 ] C /x(st( *5(A)) = ri( *L) 
implies that S n — > L. So, assume the other possibility, that *<S'(Q) ^ G(0) for any £1 e Woo. This 
implies that S is unbounded. Let m £ TO and let y = max{|S m ± 1|, |So|, • • • , l-Sml}- Then there is 
some p £ TO such that \S m ± 1| < y < \S P \ and p > m. Thus by ^-transform, given any A € TO^, 
there is some SleI M such that | *S*(A) ± *1| < S(Q). Notice that (i) *5(A) ± *1 £ TO+ , in which 
case, since *S(A)- *S(fi) € /i(0), it follows that *S(fi) G TO+ or (ii) *5(A)± *1 £ TO" , in which case 
*5(fi) e TO" . For case (i), consider *S*(A) + 1 < *S(ft); for case (ii), consider *S*(0) < *S(A) - 1. 
For these two cases, this yields that 1 < | *S , (fi) — *S'(A)| /u(0). This contradicts the hypothesis 
that *S(A) - *S(Q) £ /i(0). The proof is now complete. | 
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5. ADVANCED SEQUENTIAL CONVERGENCE 

Recall that a double sequence S: M x IN — > IR converges to L £ IR iff for each y £ TR + , there 
is some pel such that for each pair n,m £ IN, such that n,m > p and \S(n, m) — L\ < y. The 
same nonstandard characteristics hold for such convergence as in the single sequence case. 

Theorem 5.1. A sequence S: IN x IN — > IR converges to L £ TR iff *S(A, ft) — L £ /u(0) for each 
A, ft £ INoo iff *S(A,ft) £ n{L) for each A, ft e INoo iff st( *<S'(A, fi)) = L /or each A, ft e I M z/f 
(*5'[IN 00 x INoo] C /*(£,).) 

Proof. With but almost trivial alterations, this proof is the same as the one for Theorem 4.2. | 

Example 5.2 Let S{m,n) = j^j. Then for each A, ft £ M^, = L + ft 2 £ G(o). 

Hence, e /u(0) and, thus, S'(n, m) — > 0. 

The following results, and many more, for double sequences follow in the same manner as in 
Chapter 4. 

Theorem 5.3. Every convergent double sequence is bounded. 

Theorem 5.4. (Nonstandard Cauchy Criterion.) The sequence S: IN x IN — > IR converges to 
L £ TR iff for each A, ft, A', ft' £ M^, *S(A, ft) - *S(A',ft') £ /z(0). 

In the theory of double sequences, one of the interesting questions, at the least to most mathe- 
maticians, is the role played by the iterated sequences, (in brief limit notation) lim„(lim m s(n, m)) 
and lim„(lim m s(n, m)). What this notation means is that, taking the first iterated limit, you might 
have that for each n, lim m S(n,m) = S'(n) £ TR. Then, maybe, lim„ S'(n) £ TR. Now for a conver- 
gent double sequence, is it always the case that the iterated sequence converges? In the example 
5.2, notice that for n = 0, S(n,m) diverges. Indeed, take any natural number a. Then the sequence 
S(n, m) — 1+m (^„ a ^ will have this same problem for n = a. 

Example 5.5. Consider the sequence S(n,m) = ^"^^ ■ Then for any n £ IN, S(n,m) — ► 1, 
while for a fixed m, S(n,m) — » 0. This shows that the double sequence does not converge since the 
n,m £ M are arbitrary pairs and as such it should not matter if one is held fixed and the other 
varies, the limit being unique, as in this single sequence case, must be the same in all cases. As is 
well know, this behavior for double sequences is simply a reflection of the same problems that occur 
with multi- variable real valued functions. 

The problem displayed by examples like 5.2, does not occur for members of INoo as indicated 
by the following rather interesting pure nonstandard result. 

Theorem 5.6. Let S(m,n) converge to L. Then for any sequence ft m £ Moo, 
lim m st( *S( *m, ft m )) = L [resp. lim„ st( *<S(ft„, *n)) = L. 

Proof. Let S(m,n) — > L. We know that for y £ TR + there is some p £ IN such that for each 
pair m, n £ M and m > p and n > p, \S(m, n) — L\ < y. Now p may be assumed fixed for the y. 
By *-transform, it follows that | *S( *m, b) - L\ < *y for each *m > *p, and 6 £ *IN, 6 > *p. Hence, 
in particular for any sequence ft m £ INoo, | *S( *m, ft m ) — L\ < *y. Now taking the standard part 
operator on each side of this inequality implies that |st( *S( *m, ft m )) — L\ < y for each *m > *p. This 
statement is sufficient to state that st( *S( *m, ft m )) — » L. (Note: Technically, the m that appears 
in the sequence notation ft m should be considered as a member of CT IN. But, this does not come 
from the *-transform of any standard sequence or any allowed formal statement using our simple 
language.) | 
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The use of a sequence such as Q m changes the double sequence S(m, n) into a nonstandard 
type of ordinary sequence. One of the major concerns for double sequences is their relation to the 
iterated limits, where I'll use abbreviated limit notation. There are, of course, standard theorems 
that relate convergence of double sequences to the convergence of iterated limits. 

Theorem 5.7. Let 5(777, n) — > L £ K. Then lim m (lim„ S(m, n)) — L iff lim„ 5(m, n) exists for 
each m€ I. 

Proof. The necessity is obvious. So, assume that lim„ S(m, n) = r m £ M for each m £ 
IN. Then, by ^-transform for each Q, £ M^, st( *5( *m, il)) = r m for each m £ IN. All we need 
to do is to consider any sequence Q m £ Woo, like the constant sequence tt m = Q, and obtain 
lim m (st( *5( *m, fl m )) = lim m r m — L by Theorem 5.6. | 

A theorem such as Theorem 5.7 holds with an interchange of the n and m symbols. Theorem 
5.7 gives a condition under which an iterated limit will converge to the limit of a converging double 
sequence. But, are there necessary and sufficient conditions that determine completely when the 
limit of a double sequence corresponds to the limit of both iterated limits? Of course, if there is, 
it probably is not obvious. We need something special to happen. Consider the limit statement for 
lim m 5(771, n), where lim m 5(m, n) £ M. Then lim m S(m, n) converges uniformly in n iff for each 
y £ IR + there exists some pel such that for each n £ M, and m, m' £ M, where m, m! > p it 
follows that \S(m, n) — S(m', n)\ < y. Thus, the p is such that the sequence S(m, n) seems to behave 
like an ordinary convergent sequence independent from the actual value of 77 £ M. Let's see if this 
notion has a somewhat simply nonstandard characteristic. Indeed, one that parallels the statement 
for a sequence being Cauchy. 

Theorem 5.8. Let S: M x M — > K. Then lim m S(m,n) converges uniformly in n iff 

*S(A,n) - *S(n,n) £ /j(0) 

for each A,f!£ and for each n £*M. 

Proof. For the necessity, simply consider the *-transform. Use the fact that from the definition 
*y is arbitrary, and then select particular A,f!£ M^. 

For the sufficiency, assume that for each 77 £ * IN, *S(A,n) — *S(ft,n) £ /i(0) for each pair 
A, ft £ INqo. Let y £ TR + . Notice that, for a particular A, 51, there's a T £ such that A, 17 > T 
and I *5(A, 77) - *S(fl, n)\ < *y. Thus, the sentence 

3x((x £ M) A VyVzVw((y > x) A (z > x) A (w £ W) -> (|5(y,w) - *S*(z,w)| < *y))) 

holds in *M and, hence, in A4 by reverse ^-transform. This completes the proof. | 

Corollary 5.9. If for each n £ M, lim m 5(777, 77) = S n £ TR, then lim m S(m, n) converges 
uniformly in n iff for each n £ *TN, *S(A,n) — *S(n) £ ri(0) for each A £ M^. 

Proof. This follows immediately from Theorem 4.15, the Cauchy Criterion for convergence. | 

There is a standard necessary and sufficient condition for the equality of the limit of the double 
sequence and its iterated limits . 
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Theorem 5.10. Let S: IN x IN — > IR. Then lim S(m,n) = lim TO (lim„ S(m, n)) = 
lim„(lim m S(m, n)) G IR iff 

(i) linim ^(to, n) converges uniformly in n and 

(ii) lim„ S(m, n) converges for each n G IN. 

Proof. For the necessity, it's clear that (ii) follows from the convergence of the iterated limit. 
Then lim„(lim m S(m, n)) G IR, implies that lim m S(m, n) G IR, for each n G IN. Hence, by the Theorem 
4.15, *S(A,n) - *S(Q,n) G ^(0) for each n e a M and each A, O G Moo. By Theorem 5.4, we also 
have that *S(A,T) - *S(Sl,T) G /z(0) for each T G Moo- Hence, *S(A,n) - *S(Q,n) G /z(0) for each 
n G *M. Therefore, Theorem 5.8 yields that lim m 5(m, n) converges uniformly in n. 

For the sufficiency, let lim TO S(m, n) — S n = for each n G IN. Uniformly in n means 
that this limit is independent from the n used. By ^-transform, we have that for any SI G 
Moo, lim m (st(*5(*m,0)) = st(*5(fi)). Thus, for each A G M^, st(*S(A,fi)) = st(*S'(fi)). 
From (ii), we have that, in like manner, st( *5(A, D,)) = st(*5(A, T)) for each T,fl G Moo- 
Hence, *5(r) - *S(fl) G /x(0). Thus, *5(A, T) - »S(r) G /x(0), *S(A,fi) - G /i(0), which 

implies that *S"(A,0) - *S , (A,T) G /z(0) for all A,0,A,T G M^. Hence, limS(m,n) = L = 
lim ra (lim m 5(m, n)) — > L G IR by Theorem 5.4. Now apply Theorem 5.7 and the proof is com- 
plete. | 

Although Theorem 5.10 is a necessary and sufficient condition, it's often difficult to apply from 
the knowledge of the iterated limit behavior. There arc, as one would expect, special classes of 
double sequences where convergence of an iterated limit implies that the double limit converges. 
Many double sequences can be put into a form S(m, n): IN — + IR, where lim m S(m, n) — > for each 
n G IN and for each m G M, S(m, n) is decreasing [resp. increasing] in n. 

Theorem 5.11. Let S(m, n): IN x M — > IR and lim m S(m, n) = 0, for each n G IN and S(m, n) 
is decreasing [resp. increasing ] in n for each m G IN. Then S(m, n) — > 0. 

Proof. I show this for the decreasing case since the increasing case is established in like manner. 
We have that lim m S(m,n) = S'(n) = for each n G M. Thus, st(*S(A, *n)) = S'(n) = for 
each A G INqq. Now lim„S"(n) = implies since, S' is decreasing, that < S'(n) = st(*5(A, *n)) 
for each n G IN. Thus, in general, either *0 < *S(A,fi) for G Moo or *5(A,f2) G fi(0). But, if 
*0 < *S(A,tl) < *S(A, *n), then < st(*5(A,fi)) < st(*S(A, *n)) = 0. Hence, S(m,n) -» and 
the proof is complete. | 

The real numbers are complete. Hence, any nonempty set A C IR that is bounded above (i.e. 
there is some y G IR such that x < y for each x G A) has a least upper bound that is denoted 
by sup A. This means that sup A is an upper bound and if y G IR is an upper bound for A, then 
sup A < y. The greatest lower bound inf A exists for any nonempty B C IR that is bounded 
below. These ideas are applied to sequences that have convergent subsequences. Indeed, if 5 [IN] 
(the range of S) is bounded above [resp. below], them supS'fM] [resp. inf 5 [ IN]] is an accumulation 
point and there is a subsequence that converges to this point. (I'll show in the proof of Theorem 
5.14 (ii) a method that you can modify to establish this result.) 

Definition 5.12 (lim, inf, lim, sup.) Given the sequence S: IN — > IR. Let y G E iff there is a 
subsequence S' of S that converges to y. The lower limit (for S) lim inf S n = inf E, and the upper 
limit (for S) lim sup S n = supi?. 

Notice that if S n has no upper bound [resp. lower bound], then there is a subsequence S' n such 
that S' n — > +oo [resp. S' n — > — oo]. In order to consider subsequences that diverge in this ±oo special 
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sense, the two symbols — oo, +00 are included in the set E and if —00 £ E [resp. +00 £ E], then, 
by symbolic definition, let inf E = —00 [resp. sxvp E = +00], and no other cases need to be defined 
for sequences. We know that S has a subsequence that converges to L £ TR iff *S[EJ 00 ] n /x(L) 7^ 0. 
Further, the answer to the question 1 asked immediately after Theorem 4.14 is yes. So, because 
of this, the definition of st can be extended to the case where a subsequence diverges to ±00. For 
A £ BJoo, if *S(A) £ m± , let st( *5(A)) = ±00. By Theorem 4.10, the following result clearly holds. 

Theorem 5.13 Let S: M — > TR. Then liminf S n = inf{st( *5(A)) | A e TN^} and limsupS„ = 
sup{st(*S*(A)) I A G Woo}- 

Theorem 5.14 Let S: W -» TR. Then 

(i) liminf S n = —00 [resp. limsupSVi = +00] iff there exists some A £ such that 
*S(A) £ IR" [resp. TR+] iff *S[*Wn IR"] + [resp. TR+}; 

(ii) liminf S n = L £ TR [resp. limsupSVi] iff there exists some A £ INqo such that *S(A) £ 
H{L) {i.e. st(*5(A)) = L) and for each ft £ I M , *S(ft) £ fj,(L) or *S(Q) > *S(A) [resp. <]. 

Proof, (i) Let liminf S n — — 00. This implies that for each y £ TR~ and for each m £ IN there 
exists some n £ M such that n > m and S n < y. It should be obvious by now that such a statement 
means in our nonstandard structure that for any a £ TR^ and A £ INqq there is a ft £ M such that 
ft < A and, hence, ft £ such that *5(ft) < a. 

For the sufficiency, let y £ ]R~, m £ M. Then we know that if a £ TR^, then a < y. The 
hypothesis states that the sentence 

3x((xe *M) A(x> *m) A ( *5(x) < *y)) 

holds in *A4; and, hence, in M.. In like manner, for the sup. Thus (i) is established. 

(ii) Since liminf S n = L, the set E contains, at least one real number. I'll show that L £ E. 
What we do know is that there is a subsequence of S n that converges to some number > L. Hence, 
there exists A £ such that *S(A) £ G(0). Let P = {st( *S(a)) | (a £ M^) A ( *S(a) £ G(0)} ^ 0. 
Now liminf S n = inf P — L. From definition of "inf," if real r > L, then there is some p £ P such 
that <p - L < r - L. Thus, let < r - L = l/(2n), n £ IN, n^O. Then there is some p(n) £ P 
such that < p(n) — L < l/(2n). Since p(n) is the limit of a subsequence Q, then there exists some 
m £ IN such that \Q(m) — p(n)\ < l/(2n). Since Q n £ S[W] then by defining Q m — Q' n , we have 
that Q' is a subsequence of S such that \Q'(n) — L\ < l/n, for each nonzero n £ IN. Hence, Q' n — > L 
implies that L £ P. Of course, P — E, as E was previously defined. 

Now, there exists A £ such that *S(A) £ n(L). Assume that ft £ and *S(Q) ^ 
Since L ^ —00, then (i) implies that *S(Q) ^ TR^. Further, note that *<S(ft) ^ /u(r) for any real 
r < L, since if this was so than there would be a subsequence of S that converges to r and then this 
contradicts the notion of "inf." Thus, in this case, *S'(ft) > *S(A) (recall the monads are disjoint). 
The "sup" follows in like manner and the proof is complete. (The sufficiency is left to reader. )| 

Corollary 5.15. For a given S: M x IN — > TR, let E contain the limits of each converging 
subsequence. Then inf E £ E [resp. sup E £ E] . 

Proof. This is established in the above proof for real valued "inf" and "sup." Now obviously 
by definition, it also follows for the two defined cases of ±00. 
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Example 5.16. Let S: M -> m. 

(i) Define S n = (-1)"(1 + l/(n + 1)). Let AeI M bea *-odd number. Then *S(A) = 
-(1 + 1/(A + 1)) e Then taking a *-even 0, it's seen that *S(Q) E /u(l). Since for each n G IN, 
— 1 < S n < 1, we have that lim sup = 1, liminf S n = —1. From Theorem 5.14, we also know that 
for each T E that *S(T) E or *S(T) E or *5(A) < *S(r) or *S(r) < 

(ii) Let 5 n be the sequence of all the rational numbers. (Yes, technically there is such 
a sequence.) Then simply from noticing that there exist negative and positive infinite rational 
numbers, we have that liminf S n = — oo, limsupSVi = +oo. 

(iii) Let S n and Q n be any two sequences. Then 

liminf S n + liminf Q n < liminf (S n + Q n ) < 

lim sup(5„ + Q n ) < lim sup S n + lim inf Q n . 

Proof. Let A = {st(*S(A)) | A e Moo)}, B = {st(*Q(nj) \ A e Moo}- If A and B are both 
nonempty, then nonempty {st(*5(A)+ *Q(A)) = st( *S(A)) + st( *Q(A)) | A E M x } = A + B, where 
this "addition" definition is obvious. The result now follows from the "well known" result (taking 
into account the ±oo possibilities) that inf A + inf B < inf (A + B) < swp(A + B) < sup A + sup£>. 

(iv) Let S n — > L E TR and Q n be any sequence. Then lim inf (S n + Q n ) = L + 
liminf Q„, limsup(5„ + Q n ) = L + lim sup Q n - 

Proof. Let A = |st(%)(A)) | A E Moo}- By a trivial proof, when we use the symbols ±oo, 
we mean that they correspond to any member of IR^, it follows symbolically that for any a <G 
G(0), ±oo + a = ±oo. Recall how the definition of the st operator has been extended to ±oo. For 
any a£*l, st(a) = ±oo iff a E M^. Thus under this definition A ^ 0. This definition also satisfies 
the usual extend algebra for ±oo. Now we know that for each A <G Moo, *S(A) E f-i(L). Under this 
extended definition, it follows that for each !lel m st( *S(Q) + *Q(A)) = st( *S(Q)) + st("Q(A)) = 
L + st( *Q(A)). The result follows as in the proof of (ii), that inf A + L = inf B. The "sup" part 
follows in like manner. 

(v) Let S(m,n) — > L e IR. Then lim n (lim m inf S(m, n)) = lim n (lim TO sup S(m,n)) = 
lim m (lim„ inf S(m, n)) = lim m (lim„ sup S(m, n)) = L. 

Proof. Now this can be established by nonstandard means. But, it's immediate from the fact 
that S(m, n) — > L iff every subsequence S'(m, n) — > L by just considering n or m as fixed. | 
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6. BASIC INFINITE SERIES CONCEPTS 

Sometimes it's useful to simplified the notation for the finite and infinite series. Let A(n) = 
E&=o ak = So ak > t ncrc k E M. Then by definition this series converges to L iff A(n) — > L. 
Hence, all of our previous nonstandard characteristics for sequential convergence apply. For example, 
A(n) — > L iff *A(A) G for each A G Moo. Notationally, you will also see this written as 

Eo *dk ~ L = *L. These are called hyperfinite or *-finite summations. Indeed, any set such as 
{n | ( *0 < n < A) A (n G *M)}, where A G Moo is a *-finite set. The reason it's termed *-finite is 
that *-finite sets satisfy any of the finite set properties that can be presented in our formal language. 
To show that most of the basic manipulations done with a finite series hold for *-finite series, it's 
necessary to give a more formal definition for infinite series than is usually presented. 

Definition 6.1. Let a: M — > IR. Then the partial sum function A(k) is defined inductively. 

(i) Let A(0) = a ; 

(ii) then A(k + 1) = A(k) + a k+1 , fc G IN. 

(iii) Further, define for any n,m G W, n < m, A(n,m) — A(m) — A(n) = Yln+i ak ana - 
A(—l, 0) = a and if m = n ^ 0, then A(n, n) = 0. Notice that A(n — 1, n) = o„ in all cases. 

Observe A: U — > K. Thus, there's the nonstandard extension of this function to *A: M — > *B. 
Further, we know that for n < m, n, k € U, A(m) = A(n)+^4(n, m) = A(n, m)+A(n). This property 
also holds for A < 0, A, G Mqq. But not every ordinary mathematical process that can be done will 
hold in M. for *A. Whatever holds must be expressible in our formal language. This is not always 
possible. One thing that cannot be so expressed, generally, is the notion of "any rearrangement" of 
the members of a infinite series. What is needed is a specifically stated rearrangement. For each 
example, define Q n (k) — n — k for k G [0, n\. Now applying this to the finite sequence of terms for 
our finite series ao + • • • + a n yields bo = a n + ••• + &„ = ao . This can be viewed as a new sequence, 
and by ""-transform, it has meaning for any A G INoo • 

Because of how the "term generating" function is defined, it may be convenient to assume that 
the first few terms, say ao, a\, 02, ■ • ■ , a k , k < n, all equal zero. I assume that all sequences a k 
that are defined for n G IN, where n > k, are extended, if necessary, to sequences defined on BJ 
by letting Oj = 0, < i < k. (There will be times when this is not done and the notation will 
indicate this.) Moreover, it's also clear that removing finitely many terms from a series does not 
effect whether it converges or not. Thus, given original A: W — > IR, to determine whether A{n) 
converges you can use a different B: IN — > H obtained by letting b n = a k + n for any fixed k > and 
then investigate the convergence of B(n). Of course, you would need to adjust the two limits if they 
do converge. However, mostly, one is interested in the terms of a series, the a k . Further, note that, 
for A, n G Moo, A < Q, *A(A, SI) = *a k . 

Theorem 6.2 Let a: M — > IR. be a bounded sequence. Then there exists *M G a TR such that for 
each A, ft G Mco, A < ft 



n- a+ *i 



< *M. 



Proof. Since a k is bounded, then there exists some M G IR such that \a k \ < M. One of the most 
significant results for finite series is that for n < m, \ a k \ < \a k \. By defining the sequence 
b k = \a k \, then by ""-transform it follows that for A, ft G M^, A < Q, \ J2a 
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E a | *dk | ■ Now for the standard series since | ak | < M, then E™ a k < M(m — n+l). By *-transform, 
it follows that for A, e Moo, A < fl, \ Ea < *M(Q - A + * 1) and the result follows. | 

I restate some of the previous nonstandard sequence results that now characterize convergence 
of the series A(n). 

Theorem 6.3. The series A{n) L iff *A(A) - L g (jl(0), for each A g Moo iff *A(A) g (jl(L), 
for each A g Moo iff st(Eo a fc) = ^ f or each ^ e iff *A[M ryo ] C 

Theorem 6.4. (i) A(n) -» L iff for each A,fle Moo, A < fi, Ea a fc e m(0) iff *A(A, 0) g /z(0). 
(ii) If A(n) — > L, t/ien an € ^(0), /or eacft O € Woo. 

Proof, (i) First, note that if A = SI, then *A(A, Q) = and - 1, fi) = *A(Sl) - *A(Q - 1) = 
En°fc = a n . If A < SI, then *A(A, CI) = *A(fl)-*A(A) = Ea+i a k- If A > fl, then *A(fl)- *A(A) = 
— Ea+i a k- The result, in general, comes from the Cauchy Criterion and, clearly, we may assume 
that A < f2. (ii) This is immediate. | 

Although it's not required in our investigations, the converse of Theorem 6.4 (ii) holds for 
certain series. Recall what Godel wrote, that removing one quantifiers from a characterization is 
significance. So far, the nonstandard characterization do just this and often remove all quantifiers. 

Theorem 6.5. If A(n) — ► L, then a n — > 0. 
Proof. From Theorem 6.4 (ii). | 

Example 6.6. 

(i) Lct ^ = ( fc+ i)( fc+ 2) - Lct A e Moo. Then *A(A) = £o (fc+1) 1 (fc+2) = Eo ITT ~ £o F+2 = 
1 + Ei" fe^x — Ei" l+T — A+2 = 1 + — ^2 ^ ^-transform of the finite case and I have applied the 
convention of writing *x = x for *x g CT ]R. But, € A*(0). Hence A(n) — > 1 or, as is often written, 

E[T a fe = L 

(ii) Let A(x) = Eo° x?C ' # ^ 1- We know that, in general, ak(x) = . Hence, *oa = 

1 ~^ A x +1 = + for x g CT ]R. If |z| < 1, then g /u(0) implies that *a A (x) g Mr=^)- 0n tnc 
other hand, if \x\ > 1, then — ^ G(0), for A g M^. Thus, the series diverges. 

When compared with a general series, it's often easier to show that a non-negative type series 
converges or diverges. A series A(n) is non-negative iff there is some m £ I such that ak > for 
each k >m. 

Theorem 6.7. A non-negative A(n) converges iff there is some A g Moo such that *^4(A) g 
G(0) iff *A[*M] C G(0). 

Proof. There is some m £ I such that ak > for all k > m. Thus we write A(n) in terms of a 
new sequence B(n), where A(n) = B + £>(«) and is an increasing sequence. The result follows 
from Theorems 4.2 and 4.7 and the fact that *B(A) g G(0) iff S + *B(A) g G(0) for any Bel 
And, an increasing sequence converges iff it is bounded. The result follows for A is bounded iff B is 
bounded. | 

Theorem 6.8. A non-negative A(n) diverges iff there is some A g Moo such that *^4(A) g" G(0) 
iff *A(*W) <t G(0). 

There are standard arithmetical results that can aid in determining convergence. 
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Theorem 6.9. Given A: M — > M. Let /: IN — > IN have the property that, for each A(f(n + 1)) — 
A{f(n)) > b n for each n > j [resp. <]. Then for each n € IN, n > j 

n 

A(f(n + 1)) > A(f(j)) + bk, [rcsp. < ]. 

j 

Proof. [For >.] For n — j, clearly, the result holds. So, assume it for m > j. Then consider 
m + 1. Since A(f(m + 2)) - A(f(m + 1)) > b m+1 , then A(f(m + 2)) > A(f(m + 1)) + b m+1 > 
A(fU)) + E; b n + b m+1 = A(f(j)) + b k the result holds by induction. | 

Example 6.10. Let's determine convergence or divergence directly for two very well know 
series. In all cases, we extend any series to include the necessary zero terms although they might be 
directly mentioned. 

(i) Consider bk = 1/fc, k > 0. We look at the series ao — 0, a k = 1/k, k > 1. Define /: IN — > IN 
by letting f(n) = 2™. Then, for n > 1, 

2 n+l o„ 

A(2™ +1 ) - A(2 n ) = V - = V > ———t- = 1/2. 

v > \ ) fc 2™ + k ~ 2 n+1 7 

Applying Theorem 6.9, A(2 n+1 ) > A(l) + £"(1/2) = V 2 + n / 2 - Consequently, for A e M^, 
A(2 A+1 ) e Moo and the series diverges. 

(ii) Now let's look at famous "p" series, where bk — l/k p , k > 0. (a) First, let p > 1 and look at 
the series a = and a k = l/k p , k > 1. As done for (i) A(2 n+1 ) - A(2 n ) = J^T ( 2 "+fc)p ■ 1 notc that 
each term of this sum is less than 2~ pn and there are 2" terms. Hence (2"+fc)p < implies 
from Theorem 6.9 that 

™ 9 fe 

0<^(2" +1 )<l/2P + ^^. 

But, since p > 1, *A(2 A+1 ) e G(0) and the non-negative series converges. 

Now, for < p < 1, each term of the finite sum ^ 2n + k y, > 2 (»+i) P ■ Hence, as done above the 

A(2 n+1 ) - A(2 n ) = V > , 2 " > — . 

v ; y J (2™ + k)P ~ 2(™ +1 )p ~~ 2 p 

Consequently from Theorem 6.9, A(2 n+1 ) > 1/2 P + ^ and for this non-negative series *A(2 X+1 ) ^ 
G(0) and the series diverges. It obviously diverges for all p < 0. 

All of the standard converges or divergence tests can be translated into appropriate nonstandard 
statements. However, here is an interesting nonstandard comparison test. 

Theorem 6.11. Let Eo° a *= be a non-negative. If non-negative B(n) converges and there is 
some c G *TR such that < c and c e G(0) and, for each A e IN,*,, *a(A) < c(*6(A)), then A(n) 
converges. 

Proof. Assume that B(n) converges. Then for each A, ft S INqq such that A < O, Y^a *bk € 
/u(0). Also there is some r e IR + such that c < *r. Hence *0 < *a\ < c(*6(A)) < *r(*6(A)), for 
each A € INqq. By *-transform of the finite case, this implies that < J2a * a k < Ea * r i*bk) = 
*rJ^A *b k € ^(0). This completes the proof. | 
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Theorem 6.12. If non-negative bk diverges and there exists c > 0, c G *M — /x(0) and 
*a(A) > c( *6(A)) /or eac/i A £ M^, £/ien X)o° flfc diverges. 

Proof. Since bk diverges, then there exist A, Q, A < £1 and *bk </ /"(O). We also know 
that there exists some r £ TR + such that *r < c. Therefore *rJ2\ *^fe = Z)a * r *^fe ^ m(0)- Hence, 
since J^a * a fc ^ * r *^fc > 0, then *a fc </ /z(0) and the result follows. | 

Example 6.13. Assume that a n u k converges for li^fl. Then akx k converges abso- 
lutely for each x such that |a;| < \u\. 

Proof. Let b = < 1. Hence, the geometric series b k converges for such an x. Let 

A e DJqo Then *a(A)u A e m(0), since J^o^ a n u k converges. Notice that 



*a(A)a/ 



= | *a(A)u A |6 A < 6 A , 



since | *a(A)w A | < 1. You can apply Theorem 6.11, where c = 1. 

In the Chapter "Series of nonnegative terms" (1964, p. 55) W. Rudin states that "One might 
thus be led to the conjecture that there is a limiting situation of some sort, a 'boundary' with all 
the convergent series on one side, all the divergent series on the other side - at least as far as a series 
with monotonic coefficients are concerned. This notion of 'boundary' is of course quite vague. The 
point we wish to make is this: No Matter how we make this notion precise, the conjecture is false." 
However, Rudin's statement using the phrase "No matter how" in this section on non-negative series 
is itself false. Theorems 6.7 and 6.8 show that G(0) is just such a "boundary." 

Here is another example of the usefulness of the nonstandard methods and direct proofs. 

Example 6.14. Let each at > 0, X)o° a *: converge and at+i < afe for all HI. Then 
lim na n = 0. 

Proof. Let A G M^. For each non- negative real number r, there exists a unique natural 
number [r] such that [r] < r < [r] + 1. This statement and property can be written in our formal 
language. Thus by ^-transform, since A/2 <G 3Roo, there exists a > 0, a e *M and a — [A/2]. 
Now < [A/2] - A/2 < 1. Hence, necessarily, a = Q = [A/2] e Moo- Then *A(A) - *A(fi) > 
(A - fi) *a(A) > (A/2) *a(A) > *0 since such a statement holds in m. Thus, A*a(A) E /j,(0) and the 
result follows. 
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7. AN ADVANCED INFINITE SERIES CONCEPT 

Some of the most interesting aspects of the nonstandard theory of infinite series are developed 
when various infinite series product notions are probed. But we need the following result Abel's 
summation by parts. 

Theorem 7.1. Let series A: IN — > 1R, B: BJ — » TR. Then for each p,q E M, p < q 

q q q q-1 

a kh = ]T(A(fc) - A(k - l))6 fc = A ( k ) b k - J2 A ( k )bk+i = 
V v v p— 1 



j2 A(k)(b k - 6 fc+ i) - 4(p - l)6p + 



w/iere A_i = 0. 

Theorem 7.2. For eac/i A, O e Moo, A < 
n n 

Y*ak*b k = J2 A (k)(*h- *6fc+i)- *A(A-1)*6(A) + *A(0)* 6(0 + 1). 

A A 

Proof. By ^-transform. | 

One can immediately induce upon the right-hand side of the equation in Theorem 7.2 various 
requirements that will force it to be an infinitesimal. This will be seen in the proof of Theorem 7.3. 
But, first, notice that for the collapsing series EcT^fc — b k+ i), we have for each A,Oe T^oo, A < O, if 
|Ea *b k -*b k+1 \ = |*6(A)- * 6(0 + l)| < £aI*&*- *b k+ i\ € p(0), then *6(A) - *6(Q + 1) e M (0), 
for each such O and ft. 

Theorem 7.3. IfY^{bk — b k +i) converges absolutely and A: BJ — > IR. is bounded, then Eo° &kb k 
converges. 

Proof. From Theorem 7.2, 

n n 

|^*a fe *6 fe |<^|A(fc)(*6 fe - *& fe+1 )| + |- M(A-1)*6(A) + M(0) *6(Q + 1)|. 

A A 

Since there is some r € 3R + such that | *A(r)| < r for each T e M^, then 

|^*a fc *6,|<r^^|*6 fe - *6 fc+1 |^J + |* 6(0 + 1) - *6(A - 1)|^ . 

Since ^^°(6 fe -6 fe+ i) converges absolutely, then | *6(0 + 1) - *6(A - 1)| = IEa-i(*^- * b k+i)\ < 
Ea-i I * b k - *b k +i\ € m(0), Ea I * & fe - * b k+i\ G M(0) and the result follows. | 

Corollary 7.4. Let A M — > M 6e bounded. //Eo°(^ fe — ^fe+i) converges and {b k } is decreasing, 
then Eo° a *^fc converges. 

Proof. Observe that for each k e U, 6^ — 6fe + i > and, hence, Eo°(^ — &fc+i) ^ s absolutely 
convergent. The result follows from the previous theorem. | 

Now let's complete this chapter by investigating the "Cauchy product" and show how nonstan- 
dard methods aid intuition. I'll "play around" with the "subscript" notation somewhat and one 
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needs to understand what the double summation symbol is actually trying to indicate. The inner 
most of the two summations symbols will always indicate a "finite" summation where the index 
limit symbol is considered as fixed. Thus the notation E&=o (S^=o a J^ fc -i) mcans that you fixed 
each k, < k and obtain the value of the finite sum Ej=o a jbk-j- Then add all of the n+1 results 
together to get the double summation. I won't go through what some consider to be an "easy" proof 
that for non-trivial n > 1. 

C In 



(n \ / ra \ n Ik \ n-1 k \ 

ak H bk = H a 3 hk -3 + J2 \J2 an-k+jb-a-j . (7.5) 

/ V / k=0 \j=0 J fe=0 \3=0 J 

In the above expansion, the double sum indicated by the C is the most significant. Indeed, let 
c k = Y^j=Q a jbk-j- This is often called the Cauchy product. Then you have the sequence (i.e. 

series) C(n) = J2o c k = ELo (Ej=o a j b k-i) ■ 

Theorem 7.6. Let A{n) — > L a and B(n) — > L b . Then C(n) — » L a L{, zjff, /or any f2 G 

Woo, E*=d (E -=o - k + j) *b(n - 3)) G m(o) 

Proof. From the hypotheses, ^Eo * a fc)) ^ M-^a) an d (Eo *^fc)) ^ t JL iJ J b) f° r an y 
SI G Moo. Hence, * a fe) (Eo *^) e K L aLb)- Now Eo c fc = EaLo (Ej=o * a i* b k-j S J ■ But, 

Eto (E-=o X- e iff Efe=o (E-=o Ma k + j) *b(n j)) g m (o). i 

Although Theorem 7.6 indicates what portion of the right-hand side of equation (7.5) must be 
infinitesimal for the Cauchy product to equal the product of the limits of two converging series, 
this characterization is not the most useful. Using our previous notation, consider the sequences A 
and B and C. Suppose that B(n) — > Lb. You should be able to show that for all n G IN, C(n) = 
A(n)Lb + Eo a k(B(n — k) — Lb). What is needed in the next few theorems is the notion of the 
maximum member of any nonempty finite set determined by a given sequence Q: U — > IR. The 
following sentence holds in M. 

VxVy((x G M) A (y G W) A (x < y) -» 3z((z G M) A (x < z < y)A 



Vw((w G M) A (x < w < y) -» Q(w) < Q(z)))) (7.7) 

(Recall that Q(x) < Q(z) is but a short-hand notation for (Q(x),Q(y)) being in the < binary 
relation.) For any two i,j G IN, i < j, the Q(w) is called the maximum value in the nonempty 
finite set {Q(x) \ i < x < j}. It's denoted by ma,x{Q(x) | i < x < j}. Further, under *-transform 
such a member of *IR exists for A, Jl G Moo, A < Q. We use this to establish the following theorem. 

Theorem 7.8. Let B{n) — ► Lb. If A(n) — > L a absolutely, then C(n) — > L a Lb 

Proof. Since _B(n) - L b —> 0, then *B(n) - * L b G G(0) for each n G *IN. Also for each r G m+ 

there is some m G IN such that for each n > m, n G *IN, | *B(n) — < * r - Let L a = \a-k\- 

For any G BJoo, consider (in simplified notation) 

n n-(m+i) 
= | ^ *a fc (*B(r!-fc)-L fc )| < \*a k \\*B{fl-k)-L b \ + 

o o 
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n 

\*a k \\*B(fl-k)-L b \<rL a + 

n — m, 

(m + 1) max{| *a k \ \ *B(tt - k) - L b \ \ fl - m < k < 11} = rL a + (m + l)e, 

where e e p(0), for A - m < k < A implies that | *a k \ E p(0), which implies that {| *a k \ \ *B(Cl - k) - 
Lb\ | 17 — to < fc < 17} C /i(0). (I have used the *-transform of (7.7).) But, r is an arbitrary member 
of H + implies that A\ e /j(0) and the result follows from Theorem 7.6. | 

What if A(n) — > L a , _B(n) — ► C(n) — > L c , then does it follow that L c = L a L\P. In order 
to establish this, I establish, by nonstandard means, two special theorems that are useful for many 
purposes. 

Theorem 7.9. If S(n) — ► L, then lim„ fc = L = lim„ *=^qrp , w/iere = s^+i. 

Proof. For each r e m+, | *S'(A) - L\ < r since for each A e M^, *S'(A) - L e (i(0). So, 
consider arbitrary r e M+. Let 17 e Woo, p = [V^i] as dchned in Example 6.14. Then p <E Moo. 
Moreover, 1/17 < 1/p 2 . So, consider 

17 £1 p p 17 

- + ^-^max{|*S x -L| |p+l<x<17} 
p 12 

I apply my previous discussion on the "maximum" object that exist in any such *-finite set. Since 
(17 — p)/17 < 1 and all the objects in { *\S X — L\ | p + 1 < x < 17} are infinitesimals and r/p is an 
infinitesimal, then the result follows. | 

Theorem 7.10. If a n — > A, b n — » £>, i/iera 

lim Eo^- fc = ^ 

n n + 1 

Proof. For each (i.e. V), A € Moo, k <E M, let 

_ Eo Hk)*KA-k) = Eo *b(A-fc)(* a (fc)-^) AEo*&(A-fc) 
A A+l A+l A+l 

From convergence, for some M e E + , | *o(d — fc)| < M for each d £ *I, d > k. Hence, | *o(A — 
k)(*a(k) — A)\< M\(*a(k) - A)\, V A e Moo- From this and ^-transform of the finite sum case, 

< E A = ' ^ *^ - glMfe) - A)\ £ M Eo 1^) - di, y A e Moo , 

Since a n — > A, implies that |a„ — — > 0, then from Theorem 7.9, 



Therefore, _Ea € m(0), VA G Moo- Consequently, 



D A ^ E \ & | A 1 ^ G M(0) ' V A G Woc - 
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*b(A-k) 'b(k) 

Note that, in general, £o b k = Eo b n-k, Vnel. Thus, by Theorem 7.9, A+1 = ^ +1 G 

VA e Moo- Hence, Z? A - AS e /z(0), VA e INoo and the result follows. | 

Theorem 7.11. If A(n) -» L„, B(n) -» L 6 , C(n) = ELo (Ej=o «A-j) ~> toen 

Lc = La,L>b- 

Proof. Recall, that C(n) = A(n)Lt + Y^o a k{B(n — k) — Lb). This can be re-expressed as 
C(k) — E^=o a jB{j — k). Then by re-arrangement of the terms, it follows that, in general, 

n n 

Cn = A ( k ) B ( n - k ), Vnel. 
o o 

Hence, by the previous two theorems, 

l im ^ = X e = li m ^ A ^- fc ) = L L b 
n n+ 1 n n+1 

a the result follows. | 
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8. ADDITIONAL REAL NUMBER PROPERTIES 

Since this is supposed to be a monograph covering some of the basic notions in a first course in 
real analysis (i.e. calculus IV), then one should expect that certain additional real number properties 
need to be explored. This is especially the case if a slight generalization of the notion of continuity 
and the like is investigated. You will discover that, once again, the monad is the nonstandard 
"king," so to speak, in characterizing these concepts. There are slightly different definitions within 
the subject of "point-set topology" for the set-theoretic "accumulation point." I has chosen to use 
a definition that makes this notion equivalent to the previous sequence definition. 

Definition 8.1. Let Ac 1 Then p £ M is an accumulation point of (for) A iff, for every 
w £ TR + , the open interval (— w + p,p + w) fl A ^ 0. A point p £ IR is a cluster point iff for every 
w £ TR + , the deleted open interval = ((— w + p,p + w) — {p}) = (— io + p,p + w)' n A ^ iff 

(—w+p,p + w)'C\A = an infinite set. 

A cluster point is an accumulation point but not conversely. Consider the set A = [1, 2] n {3}. 
Then 3 is an accumulation point, and not a cluster point. Also each member of a nonempty A is an 
accumulation point. 

Definition 8.2. The set of all accumulation points is called the closure of the set A C IR. and 

is denoted by A or cL4. 

Note that A C cl(A). 

Definition 8.3. A point p £ A C H is an interior point of A iff there exists some w £ TR + 
such that (—to +p,p + w) £ A. 

Definition 8.4. A point p £ A £ TR is an isolated point of A iff there exists some w £ TR + 
such that (— w +p,p + to) n A = {p}. 

Notice that if S: IN — > TR, then p is an accumulation point for the sequence iff p is an accumulation 
point for the set S [IN] (i.c the range). It also follows that p is an accumulation point for A iff there's 
a sequence S of members of A such that S(n) — » p. Also a point p £ TR is an isolated iff it is an 
accumulation point and not a cluster point. This last statement characterizes the difference between 
the notions of the accumulation point and cluster point. Cluster points are accumulation points 
that are not isolated. Now how do monads characterize this set-theoretic notions? 

Theorem 8.5. Let A £ TR, p £ TR. Then 

(i) p is an accumulation point iff ri(p) fl *A ^ 0; 

(ii) p is an isolated point iff (i(p) n *A = {p}; 

(iii) p is a cluster point iff the deleted monad (i(p) — {p} = [i'(p) fl *A ^ iff fi(p) fl *A = 
an infinite set. 

Proof. These are rather easy to establish and, as usual, depend upon *-transform. (i) Let p £ TR 
be an accumulation point for A £ TR. Then the formal sentence, which I'm sure you can obtain form 
the informal, 

Vx((x e TR+) -> 3y(y £ A) A |y - p\ < x)) 

holds in M; and, hence in *M. So, let < e £ fJ,(0). Then there exists some a £ *A such that 
| a — p\ < e; which implies that a £ fi(p). 
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Conversely, assume that /j,(p) fl *A ^ 0. Obviously fi(p) C * ( — w + p,p + w), Vine TR + . Hence, 
letting b £ y(p) n *A and w; e IR+ the sentence 

3y((y e *A) a |y - *p| < %)) 

holds in *.M; and, hence, in AA by reverse *-transform and the conclusion follows. 

(ii) The sufficiency follows since p is an accumulation point. For the necessity, there exists 
some w e IR + such that (-w + p,p + w) n A = {p}. Hence, * ( - w + p,p + w) n *A = { *p} = {p}, 
under our notation simplification, and the result follows for p <G C * ( — w + p,p + w). 

(iii) This follows from the observation about accumulation points, cluster points and iso- 
lated points and the fact that the only standard number in /j,(p) is p. The second iff follows, for if 
otherwise there would be a "smallest" wi g IR + such that * ( — w\ +p,p + w\)' fl *A ^ 0. | 

Corollary 8.6. (i) A point p e IR is an accumulation point for A C IR iff there exists some 
a e *A such that st(a) = p. 

(ii) A point p g TR is a cluster point for A C IR iff there exists an a e *A — a A such that 
st(a) = p. 

For B C *IR, define the standard part of B as the set st(S) = {x \ (x g m) A ^(x) nB^}. 
Of course, you can consider st(i3) C CT IR. Notice that for any Ac 1, the standard part operator is 
defined, at the least, for all members of a A. Indeed, our definitions and characterizations for these 
set-theoretic notions are only in terms of monads about standard points. 

Theorem 8.7. Let A C IR. Then st(A) = clA. 

Theorem 8.8. A point p g IR is an interior point iff fi(p) C *A. 

Proof. I'm sure you can show that ri(p) = f]{ *( — w+p,p + w) \ w £ IR + }. Hence, the necessity 
follows. 

For the sufficiency, assume that p is not a member of the interior of A. Then for each w g 
m+, (-w+p,p + w)n(TR- A) ^ 0. Thus,p g cl(SR-A) and ri{p)C\ *(TR-A) = (j,(p)C\(*TR- *A) ^ 
implies that fi(p) <£ *A and the proof is complete. | 

Definition 8.9. Let A C IR. Then the derived set A' for A is the set of all cluster points. Notice 
that the derived set contains no isolated points. Example, let A = (1,2) fl {3}. Then A' = [1,2]. 

Theorem 8.10. For A C IR, the set A' = st( *A — a A), (not using the extended definition for 

st,). 

Proof. Theorem 8.5 (ii). | 

Theorem 8.11. For A C IR, the set of all isolated point is A - st( *A - a A). 

Proof. An isolated point p for A is a member of A, and such a p is isolated iff fj,(p) fl *A = {p} 
iff fi'(p) n *A = iff p g A - st( *A - a A) (or in simplified notation) iff p g A - st( *A - A). | 

A set A C IR is closed A = clA = st( *A). The set is open iff /j,(p) C *A, V p g A. Please note 
that 0, IR are open and closed. (Actually, this is not the standard definition for an open nonempty 
set. But, I leave it to you to show that this is equivalent to the statement that for each p g A, there 
exists a w p g IR + such that (—w + p p ,p + w p ) C A. Also A is perfect if it is closed and has no 
isolated points. 
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Theorem 8.12 A set A C H is perfect iff A = A'. 

Proof. Please note that c\A = A U A' Hence, a set is closed iff A' C A. For the necessity, 
A' = st(*A- a A) = st(*A- a A)uA = st( *A- ° A) U st(<M) = st(( *A - A) U = st( CT A) = A. 

The sufficiency is clear and this completes the proof. | 

Much of our interest will be restricted to the derived set. The reason for this is that for every 
p e A' there is a sequence S: M — > p such that p ^ S[M}. Please consider the following remarkably 
short proof of the Bolzano- Weierstrass theorem. 

Theorem 8.13. If bounded and infinite A C 3R, then st(*A — a A) ^ (i.e. A has a cluster 
point). 

Proof. Since A is infinite then *A — a A ^ 0. Since A is bounded that *A C G(0). Thus, 
*A — a A C G(0) implies that st( *A - 17 A) ^ and this completes the proof. | 

One of the most important topological concepts used throughout analysis is the notion of 
"compactness." Numerous equivalent definitions for this concept exist in the literature. I select the 
most important for our purposes. Intuitively, compactness should mean "closely packet" or "close 
together" but it's different from the notion of density since density is usually a comparison between 
two different sets. Often this intuitive understanding for "compactness" is not achieved from the 
definition. I'll give a nonstandard definition that yields this intuitive notion and then show that it's 
equivalent to one of the usual definitions. 

Definition 8.14. A set i C 1 is compact iff for each b e *A there is some p E A such that 
b G fi(p) (i.e. b w p) iff *A C \J{ti(p) \ P € A} iff each be *A is near-standard (meaning w to a 
member p e A.) The set \J{n{p) | p € A} is often denoted by ns(A) (the set of all near-standard 
points). 

Our next, and what is a major, result requires what appears to be a rather long proof. I have 
not introduced the idea of the (^-incomplete ultrafiltcr and concurrent relations. For the ultrafiltcrs 
I am considering and due to real number property discussed in the next paragraph, the sufficiency 
part of the next theorem can be established in but a few lines using a concurrent relation. In general, 
this result holds for topological spaces, using a concurrent relation, if a special type of ultrafilter is 
used (Herrmann 1991). 

A set Q of nonempty open sets is said to cover of (for) Ac TR iff A C |J{^ I ^ One 
standard definition for "compactness" says, that A is compact iff for every open cover Q there exists 
a finite subset (a subcover) Qf C G such that Gf covers A. A set A is said to be countable iff either 
A or there exists a one-to-one correspondence from IN onto A. The countably compact sets are 
those that have this covering property but only for countable open covers. For the real numbers, 
due mainly to the fact that the rational numbers are dense in the reals and for every real < r < 1 
there is a natural number n such that r < 1/n < 1, if nonempty G is an open set, then there exists 
a rational number w G B + and a rational number r e B such that p £ I = (—w + r,r + w) C G. 
Thus, every open cover G of A can be replaced by a countable open cover of such open intervals 
and such that A C U{^} C [J{G | G € G}, where each member of G contains, at least, one member 
of {Ii}. Hence, replace the covering definition for compactness with countable open covers by such 
a collection of open sets. 
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Theorem 8.15. Let nonempty set A C IR. Then *A C UIMp) I P *= ^4} «if every countable 
open cover {/j} /or A /ias a finite subcover. 

Proof. Assume that A satisfies the countable covering definition for compactness but that 
*A (f. U{a*(p) I P € A}. There exists some a G *A such that a ^ for any p G CT 1R. Consequently, 
there is some open interval I(p) with rational end points about some rational number such that 
a £ * I p and p G I(p)- Let ^ be a set of all such intervals I(p). Then Q is a countable cover of A 
and there should exists a finite subcover, say {I(pi), . . . , I(p n )}, such that A C i"(pi) U • • • U L{p n ). 
Consequently, *A C U • • • U *I(p n ). Hence, we have the contradiction that a G *I{p%) for 

some i = 1, ... ,n. 

For the sufficiency, just assume that there is a countable open cover Q of A which has no finite 
subcover. Our basic aim is to construct by induction from Q another cover and do it in such a manner 
that a sequence of members of A exists which, when viewed from the * IR and with respect to any 
free ultrafilter U, the equivalence class containing this sequence is not near to any member of A. 
First, consider the nonempty countable set G' = {Ci \ i = 1, 2, . . .} = {xC\A | (a; G G) A (xtlA ^ 0)}. 
Let Dq = C\. Now, let m\ be the smallest natural number greater than 1 such that C mi <f_ C\. This 
unique number exists since {Ci} cannot be a cover of A for C\ C C for some C G Q. Assume that 
the Dfc have been defined. Let m,k+i be the smallest natural number great than m k such that 

Cm k + 1 £\J{Di\i = l,...,k}. 

These unique natural numbers continue to exist since A is not covered by any finite subset of sets 
in Q. Now define Dk+i = C mk+1 . The sets D n , Vn e I are defined by induction 

Let Gi = {D n | n G U}. Since Q is a countable cover of A, then Gi is a countable cover, although 
not generally an open cover. Further, Q\ has no finite subcover. By definition D ^ and 

D n -\J{D k | fc = O,...,n-l}^0 

for each positive n G M since D n <£_ {J{Dk \ k = 0, . . . , n — 1}. Thus, define po to be any point in D 
and for each positive n G 3N, define p n to be any point in D n — [J{Dk \ k = 0, . . . , n — 1}. (Did I use 
the Axiom of Choice or can this be considered an induction definition?) Thus, there is this sequence 
P: M — > A such that P(n) = p n . If the natural number m > n, then p m ^ Di, i = 0, . . . ,n. Thus, 
if p m G -Dfe for any k = 0, . . . , n, then m < n. This means that for each n G IN the set of natural 
numbers {x \ (x G DJ)A(P(x) G £>„} is finite. Hence, for eachn G IN, {x \ (x G M)A(P(x) ^ £)„} € U 
for any free ultrafilter U. This yields, in general, that [P] ^ *D„ for each n G IN and [P] G *A For 
-Dfe G Gi, there exists some Ck & G such that = Aflq. Let £2 be the set of all such Cfe. Since 
[P] ^ *D„ for n G IN, then [P] ^ *c n . But, the set Gi is an open cover of A. Thus, for each p e A, 
there is some G Gi such that C *Cfe. Consequently, [P] ^ U{m(p) I P € ^} an d the proof is 
complete. | 

Next, I present nonstandard proofs of a few additional characteristics for compactness, where 
trivially a finite A C IR is compact. 

Theorem 8.16. A nonempty A C IR is compact iff it is closed and bounded. 

Proof. Assume that A is compact. Since *A C UIMp) \ P ^ A} C G(0), the A is bounded. 
Now let (j,(q) nM/0. Then fi(q) n ii(p) ^ for some p G A. Hence, g = p. Thus, A = st( *A) and 
A is closed. 
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Conversely, let A be bounded. Then *A C U{m( x ) \ x e M} = G(0). Also, A ^ TR. Let any 
q E 3R - A. Since A is closed, then ri(q) n *A = 0. Thus */4 C LKMp) I P € A} and this completes 
the proof. | 

Theorem 8.17. Let infinite AcE TAeri A is compact iff each infinite B <Z A has a cluster 
point in A. 

Proof. Since A is compact, then A is bounded and, hence, B is bounded. Thus, by 8.13, B has 
a cluster point p. But, since A is closed, then p e A. 

For the sufficiency, assume that A is not compact. Then either A is not bounded or A is not 
closed. Assume that *A (£ G(0). Let r = 1. Consider the case, that A is not bounded above. 
Then there's some pi € A such that pi > 1. Let r = p\ + 1. Then there exists some P2 G ^ 
such that p 2 > Pi + 1- Assume that we have defined pk- Then there is some Pk+i such that 
Pk+i > Pfe + 1 > Pfe-i + 1 > • • • > 1. Let po = 1. Thus, there is a sequence P: BJ — > B such that 
lim„p„ = +oo. Hence, this sequence has no accumulation point in A, which in this case is equivalent 
to not having a cluster point for the infinite P[M] C A. The case where A is not bounded below 
follows in like manner. 

Now suppose that A is not closed. Then there exists some q e A' — A. Hence, there is an infinite 
sequence of distinct members of A that converges to q. Again, q would be a cluster point for A. 
This completes the proof. | 
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9. BASIC CONTINUOUS FUNCTION CONCEPTS 

For all that follows in this chapter, D will denote the domain for the real valued 
function /. Recall that the notation /:£)—> IR. means that / is a real valued function defined on 
D. Of course, in this case, / is also defined on any nonempty subset of D. First, let's consider the 
idea of the limit of / as x — > s or lim^,, f(x) or, in abbreviated notation, lim s f(x) where I use s 
so as not to confuse this with the more general notation for the specific case where we look only at 
sequences and use n or m below the lim symbol. 

Recall that for f:D — > IR, lim s f(x) = L iff for every r £ IR + , there exists some w £ M + 
such that, whenever x £ D and < \x — s\ < w, then \f{x) — L\ < r. Clearly, s must be a 
cluster point for D. That is s £ D' for this notion to have a significant unique meaning. This 
is one of the first definitions that appears in a calculus book and that often gives students some 
difficulty in its application. But, as will be seen, the nonstandard characteristics, especially (i), 
for this limit concept are much easier to state and yield the actual intuitive idea. Recall that for 
each p £ TR ri'(p) = ii(p) — {p} is the deleted monad about p and if g: B — > *K, A C B, then 
g[A] = {g{x) | x £ A}. 

Theorem 9.1. Let f: D -> IR. Then lim s f(x) = L iff 

(i) *JV(s) n *D] c m iff 

(ii) for each q £ ft' (a) fl *D, st( *f(q)) = L iff 

(iii) for each nonzero e £ ^(0) such that s + e £ *D, then *f(s + e) — L £ /j,(0) iff 

(iv) for each e £ n(0) + and x £ *D such that < \x — s\ < e, then *f(x) — L £ /x(0). 

Proof, (i) For the necessity, let lim s f(x) = L and r £ K + . Then there exists some w £ M + 
such that the following sentence 

Vx((x £ D) A (0 < |x - s\ < w) -» (|/(x) -L\< r)) 

holds in AA; and , hence, in *AA. In particular, for each p £ /i'(s) n *D, \ *f(p) — L\ < r. Since r is 
an arbitrary positive real number, and we have that < \p — s\ < w for all w £ IR + , it follows that 
for each p £ fi'(s) n *D, \ *f(p) — L\ £ ri(0) or that *f(p) £ ri(L). 

For the sufficiency, assume that r £ TR + . There exists a q £ ^i'(s) C\ *D since s is a cluster 
point of D. Thus, q ^ s and, hence, there is some e £ /i'(0) such that q = s + e. Consequently, 
< \q - s\ = |e| £ ,u(0). If p £ *D such that < \p - s\ < |e|, then p £ fi'(s) n *D implies that 
*f(p) £ m( s )- Consequently, the sentence 

3x((x e m+) A Vy((y £ D) A (0 < |y - a\ < x) -» (|/(y) - L| < r)) 

holds in by reverse *-transform and this first "iff" is established. 

All but the last "iff" are immediately equivalent to this first one. The necessity of "iff" (iv) 
is clear. The sufficiency of (iv) follows from the above sentence for the sufficiency of (i) and this 
completes the proof. | 

Corollary 9.2. If\\m s f(x) = L, then L is unique. 

Corollary 9.3. IfTcD, s £ T' and lim s f(x) = L with respect to D, then lim s f(x) = L 
with respect to T. 
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Corollary 9.4. 7/lim s f(x) = L, then there exists a nonempty open set G such that s £ G and 
*/[( *G - {s}) n *D] C G(0). Note that {s} is not an open set. 

Theorem 9.5. lim s f(x) — L iff there exists a sequence S such that for each n £ M, S„ / 
s, S n £ D; S n — > s and lim„ /(5„) = L. 

Proof. Suppose that lim s f(x) = L and that S: M — > D, 5„ — > s and that for each nG I, ^ 
s. Then for each A £ BJoo, *S(A) £ /u(s), *S(A) 7^ s and *S*(A) e *L> implies that *S(A) e //(s)n *D. 
Hence, */(*5(A)) = *(fS)(A) £ //(L) and the necessity follows. 

For the sufficiency, assume that lim s /(a;) 7^ L. Then there exists some r £ E + such that 
for each to e IR. + whenever x £ D and < |x — s\ < w, it follows that \f(x) — L\ > r. Since 
f]{ *( — w + s, s) I w £ TR + } n then for each w = 1/n, / n e I, there exists a sequence 

such that S n 7^ s, S n £ -D, and < |5„ — s\ < 1/n and \f(S n ) — L\ > r. Consequently, S n — > s, but 
f(S n ) L and the proof is complete. | 

Of course, it's this "sequence" theorem that gives the major intuitive characteristic for such limits. 
Modifying the definition for lim s f(x) = L yields the one-sided limits. Recall that the 

modifications are /(s±) = L iff for each r £ IR + there exists a w £ M + such that, whenever 
f x — s *C ' / ' 

< „ , then I f(x) — L\ < r. For these limits, the monads need to be modified in the 

10 < s - x < w 1 w 1 

obvious manner. For each p £ M, let n(p) + = {x \ (x > p) A (x € £*(£>))} = {a; | (1 > p) A (1 ~ 
P)} = f){*(P,P + w) I «> G JR + L A*(p)~ = {x I (x < p) A (x G /i(p))} = {x I (x < p) A (x w p)} = 
f}{* ( — w + p,p) I u> e Using these positive or negative monads our previous theorems and 

corollaries all hold with the appropriate modifications. 

Theorem 9.6. Let f: D -» m. Then f{s±) = L iff 

(i) 7[m'(«) ± n *£>] c M (L) 

(ii) for each q £ //(s)± n *D, st( 7(g)) = L iff 

(iii) /or eac/i e e m(0) such that s + e £ *D, i/ierc 7( s + £ /x(0) i/f 

(iv) /or eac/i e G m(0) + a7 ^ x £ *D such that { q ^ ^ * ^ ^ 1 ^ en 70*0 — L £ p(0). 
Corollary 9.7. If f(s±) = L, then L is unique. 

Corollary 9.8. IfTcD, s e T and f(s±) = L with respect to D, then f(s±) = L with 
respect to T. 

( 1+ = ( s r) 

Corollary 9.9. If f(s±) = L, then there exists a nonempty open interval < _ \ ' ! such 
that 7[(*/ ± ) n *D] C G(0). Note that {s} is not an open set. 

The following is the appropriate modification for Theorem 9.5 

Theorem 9.10. Let f:D — > IR. Then f(s+) = L [resp. f(s—)] iff there is a sequence S such 
that for each n £ M, S n £ D, S n > s, [resp. S n < s], S n — > s and lim„ f(s n ) = L. 

Proof. I prove this for f(s-) since f(s+) is done in like manner. Let f(s-) = L and S n 
s, Vn £ M, S n ^ s, S n < s, S n £ D. Then VA £ I M , *S(A) £ n(s)~ and 5(A) < s. Thus 
7(*S(A)) = *(/5)(A) £ p(L) and the necessity follows. 

For the sufficiency, the method is similar to that for Theorem 9.5. Assume that f(s—) -/-> L. 
Then there exits some r £ ~5R + such that Vw £ TR + whenever < s — x < w, x £ D, then 
\f{x) — L\ > r. Since p{s)~ = f]{ * ( — w + s, s) | w £ TR + }, by ^-transform, for G I, there is 
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some so G ( — 1 + s , s ) l~l -D and So < s. Assume that for k £ M, fc > 1, there are Sfc G (— l/(fc + 
1) + s, s) n £>, and S fc < s. Now consider k + 1. Then since (-l/(fc + 2)+s,s)nO/f), there is 
some G (— l/(fc + 2) + s, s) n D and S/c+i < s. Thus yields a sequence S: M — > £) such that 

Vn e I, S n £ D, S n — > s and 5„ < s, but |/(SW) — L| > r and the proof is complete. | 

Theorem 9.11. Lei /: D — > B and s G int(D) ff/ie sei o/a/Z interior points). Then lim s /(a;) = 
L ifff(s±) = L. 

Proof. /i'(s) = /z(s)+ U /x(s) _ . 

Example 9.12. In the usual calculus text, it's established that lim = 1, by means 
of a geometric proof. Although, I won't mention any apparent geometric facts in the following 
nonstandard proof, it might be necessary to use the geometric definitions to establish the facts I do 
use. 

For each r £ M + such that < r < n/2, since sin(r) < r < tan(r), cos(r) < sin J r ) < i. Thus, 
for each e G ^(0) + , by *-transform, 

„ . , *sin(e) 
*cos(e) < — j-!- < 1. 

But, since | sin(r)| < |r|, Vr £ R, then | *sin(e)| < e implies that *sin(e) £ /i(0). This yields 

1- *(cos(e)) 2 = *(sin(e)) 2 £ M (0) 

which implies that *cos(e) £ p(l). Consequently, 1 < st(*cos(e)) < st( sl "^ ) < 1 for each e £ 
ri(0) + . Thus, s "ff + - ) = 1. To show that this last equation holds for e £ fj,(0)~ , simply notice that 
* sin l; e)) = ^2) Ve G /i'(0). Hence, the result follows. 

All of the usual limit and one-sided limit algebra for such functions follow from the properties 
of the standard part operator. Now let's establish the Cauchy Criterion for functions. 

Theorem 9.13. (Cauchy Criterion.) Let f:D — > TR. Then lim s f(x) = L iff for each pair 

p,qeii'{a)n *D, *f( P )- 7(g)e/x(0). 

Proof. The necessity follows from Theorem 9.1. 

For the sufficiency, assume that there does not exists w £ M + such that / is bounded on 
(— w + s, s + w)' n D. Hence, for r = 1, for each w £ TR + there are, at least, two distinct x\,X2 £ 
(—w + s,s + w)' n D, X\,X2 7^ s and \f(xi) — f(x 2 )\ > 1. Consequently the sentence 

Vx((x £ TR+ -» 3y3z((y G £>) A (z £ D) A (0 < \s - y| < x)A 

(0<| S -z|<x)A(|/(y)-|/(z))>l))) 

holds in *A4 by ^-transform. So, let e G ^(0) + . Then there exists distinct p, q such that < \s—p\ < e 
and < \s — q\ < e and | *f(p)- *f(q)\ > 1. But, this contradicts the requirement that *f(p)- *f(q) £ 
/x(0). Thus, there is some w £ IR + such that / is bounded on (—to + s, s + w)' n -D. Consequently, 
7[/i'(s) n *£>] C G(0). Letting q £ fj,'(s) n *L>, then for each p £ //(s) n *D, 7(p) G ,u(st( 7(g)) and 
the result follows where st( */(?)) = L. 

Corollary 9.14. Let f: D — > m. Then /(s±) = L i#/or eac/i pair p, g G m'( s ) ± f~l *D, */(p) - 

7(g) e /x(o). 
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Theorem 9.15. Let /: (a, 6) — > ~5R and a < c < d < b. If f is increasing [resp. decreasing], 

then 

f(c-) = sup{/(x) \a<x<c}< /(c) < /(of) = inf{/(x) | c < z < 6}. 
[resp. /(c+) = inf{/(x) | a < x < c} < /(c) < /(c-) = sup{/(x) | c < x < b}.} 

Further, /(c+) < /(d-) [resp. /(d-) < /(c+)]. 

Proof. I show this only for an increasing function /. Clearly, sup{/(x) | a < x < c} — L, L < 
/(c). For any real number r < L, there exists some p £ (a,c) such that r < f(p) < L. Thus, let 
e £ /u(0)~. Then a < c + e < c implies that r < */(c + e) < L since */ is increasing on *(a,b). 
Therefore, r < st(*/( c + e)) < L. Since r < L is arbitrary, this implies that st(*/(c + e)) = L for 
each such e, and this first part follows from Theorem 9.6. The inf case, follows in like manner. 

Now if c < d, then c + e < d + 7 for each e € A*(0) + and each 7 e /^(0)~. Consequently, 
st( */(c + e)) = ,/(c+) < /(d-) = st( */(d + 7)) and the proof is complete. | 

I guess I should mention the other ordinary limit of a function notion used when D is not 
bounded above or below, the 00. Recall that if D is not bounded above, then lim,^ f(x) = L iff 
for each r e E + there exists some w € TR + such that for each p € D such that p > w, \f(p) — L\ < w. 
For D that is not bound below, this limit notion is defined in the obvious manner. 

Theorem 9.16. Suppose that f: D £ E is not bounded above [resp. below}. Then lim^ f(x) = L 
iff *f(p) € n{L) for each p £ IR+ n *D [resp. *f(p) £ ii(L) for each p £ H~ n *D). 

Proof. Left to the reader. 

Theorem 9.17. Suppose that f:—* D £ TR is not bounded above [resp. below]. Then 
limoo f(x) = L iff for each pair p, q £ m+ n *D [resp. TR X n *D], *f(p) - *f(q) £ /i(0). 

Proof. Left to the reader. | 

Our major interest is to investigate properties of continuous real valued functions defined on 
D. Since for f:D—* TR to be continuous at s, all one needs is that lim s f(x) — f(s) and, hence, we 
need s £ D. 

Theorem 9.18. Let s £ int(D). Then function f:D —* JR is continuous at s ifflhn s f(x) — 
f(s) = f(s+) = f(s-). 

Proof. Note that ri(s) = ri(s)+ U {s} U ri(s)- . 

Each of the previous theorems on the left and right-hand limits, when slightly modified, hold 
for continuous functions. Also, each monadic characteristic for continuity holds for isolated points. 
Thus, s need not be a cluster point. The changes are made by replacing the deleted monads with 
the complete monad and such statements as < \x — s| by \x — s| and the like. The must used result 
is that / is continuous at p £ D iff *f[ri(p) n *D] C ri(f(p)). Now let's apply these results to obtain 
three highly significance continuous function properties. 

Theorem 9.19. Let continuous f:D^TR and let D be compact. Then the range, f[D], is 
compact. 

Proof. Since D is compact, then *D C LUMp) I P *= D}. But, using a property that holds for 
any function, it follows that 

*/[ •£>] c (J{ 7[Mp) n*D]\ P £D}£ \jMf(p)) I f(p) e f[D}} 
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and the result follows. | 

Theorem 9.20. (Extreme Value Theorem.) Let continuous f: D — > M and D be compact. Then 
there exists p m ,PM € D such that for each p € D, f(p m ) < /(p) < /(Pm)- 

Proof. Since /[£>] is compact, then it is closed and bounded. Thus, from boundedness sup{/(x) | 
x e D} = pm and inf{/(x) | x e D} = p m . Since /[D] is closed that p m , pm € Z? and the result 
follows. 

I mention that all such standard theorems can be extended to "nonstandard statements" by 
*-transform. To establish the intermediate value theorem the notion of connectedness is often 
introduced. But, rather than do this, I'll give a nonstandard proof where connectedness is not 
mentioned. 

Theorem 9.21. Let continuous f: [a, b] — > IR. The for each d such that f(a) < d < f(b) [resp. 
f(b) < d < f(a)], there is some c <G [a, 6] such that /(c) = d. 

Proof. The result is immediate if a = b. So, assume that a < b and consider the case where 
that f(a) < d < fib). Let nonzero n £ IN and h = (b — a)/n. Then we have a finite partition of 
[a, b] {a, a + h,a + 2ft, . . . , a + nh = b}. Thus, there exists some m G I such that m < n and (i) 
/(a) < f(a+mh) <d< f(a+(m+l)h) < f(b) or (ii) f(a) < /(a+(m+l)ft) < d < f(a+mh) < f(b). 
Assume (i). By *-transform, if A e Moo, then (b — a)/ A e £t(0). There exists some mi e 'I 
such that mi < A and f(a) < *f(a + mift) < d < *f(a + (mi + l)ft) < /(&). (Note the use of 
simplified notation for such things as f(a), where technically this should be written as a f(*a).) 
Since a < a + m\h < 6, then there is a real c = st(a + mi ft) and a < c < b From the continuity 
°f /> f( c ) = /( st ( a + mift)) = st(*/(« + mift)) < d for a + m\h e /u(c) n *[a, b]. However, 
a + mift + ft = a+(mi + l)ft G yu(c) implies that f(c) = f(st(a+ (mi)ft)) = st( *f(a + m 1 + l)hj) > d. 
Therefore, /(c) = rf. The other cases follow in a similar manner and the proof is complete. | 

The results that the sum and product function and similar processes defined for continuous func- 
tions yield continuous functions follows from the properties of the standard part operator. Our last 
result in this chapter is a nonstandard proof of the composition properties for continuous functions. 

Theorem 9.22. Let continuous f:D—* IR and continuous g:T — > R be such that f[D] C T. 
Then the composition gf: D — > IR is continuous. 

Proof. Let p e D. Then *f[(J,(p) n *D] C M/(p)) n *(/[-°]) C *(/[£>]) C *T imply that 
*ff[7[ftb) n *£>]] C * 9 [m(/(p)) n *T] C K9(f(p)) and the result follows. | 
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10. SLIGHTLY ADVANCED CONTINUOUS FUNCTION CONCEPTS 

Unless otherwise specified, for all the follows in this chapter, D will denote the 
domain for the real valued function /. Here is a result, you may never have seen before, 
that also implies the intermediate value theorem. The original standard proof and result is due to 
Bolzano. 

Theorem 10.1. For continuous /: [a, b] — ► IR, if f(a)f(b) < 0, then there exists some c e (a, 6) 
such that /(c) = 0. 

Proof. First note that the hypotheses require that a ^ b. Assume that /(c) ^ for each 
c e (a, b). Since f(a) ^ and f(b) ^ 0, then /(c) ^OVce [a,b]. I now show that for each nonzero 
m€ I (i.e. m e IN') that there exist real numbers s m , t m such that t m — s TO = (6 — a)/m and 

a < s m < t m < b, and {.^ m | < 0. 

f{s m ) 

For meBJ', consider the function 

, . f(x+(b-a)/m) to — 1 „ . 

/(X) TO 

Consequently, the product U™ -1 g(a + k(b — a)/m) = f(b)/f(a) < 0. Thus, there is some k e IN, < 
k < to - 1 such that g(a + k(b- a) /to) < 0. Hence, /(a + (k + l)(b - a) /to) /f(a + k(b- a) /to) < 0. 
Let s m = a + k(b — a)/m and t m = a + (fc + 1 ) (6 — a) /to and the conditions required hold for s m , t m . 
Thus, by ^-transform, if A e I M , there exists p,g£ *M such that q — p = (b — a) / A, a < p < q < b 
and *f(q)/*f(p) < 0. Since q— p e A*(0)> then p e /x(st(g)) Consequently, using the result that 
st(g) < b and the continuity of /, *f(p) e n(f(st(q))). Therefore, st(*/(p)) = /(st(p)) = /(st(g)) 
implies that /(st(p))//(st(<7)) = 1 < 0. This contradiction yields the result. | 

To obtain the immediate value theorem from Theorem 10.1, just consider for the function 
f(x) — rf, if f(a) < d < f(b), or d — f(x) if f(b) < d < f(b) for the non-trivial cases f(a) ^ d and 
f(b) ^ d. A major result characterizes continuity on the entire set D in terms of open sets. The 
proof is a little long due to the simplified structure I'm using. Let Q be a nonempty collection of 
open subsets of IR. Then since for each p £ [J{G \ G G G}, /i(p)c *Gfor some G € Q, then 
n(p) C 1J{ *G | G <G G} C * ( {J{G | G e G}) implies that the arbitrary union of a collection of open 
sets is an open set. 

Theorem 10.2. Let f:D^> TR. Then f is continuous on D iff for each open set G C 1, 
/ _1 [G] is open in D. 

Proof. Note that a set G\ is "open" in D iff there exists an open G2CK such that G\ = G2 H D. 
Assume that / is continuous on D. Let G be an open set in IR. If G = 0, then / _1 [G] = {p | (p € 
£>) A (/(p) E G} = 0, which is open in D. The same result would hold if G n f[D] = 0. Hence, 
assume that G n f[D] ^ 0. Then let f(p) e G n /[£»]. Since G is open, then /i(/(p)) C *G 
and, by continuity, */[m(p) H *D] C m(/(p)) n *D C *G. Since /u(p) is the intersection of all the 
intervals * ( — r + p, r + p), re TR + , then there exists some (— r + p,p + r), re ]R + , such that 
p e (— r + p,p + r) n D e / _1 [£)]. Since the arbitrary union of open sets is an open set, then using 
one of these open intervals for each p e D, one gets an open set Go C IR such that Go n D = / _1 [D]. 

For the sufficiency, I'll use inverse image, / _1 , set-algebra. Consider fi(f(p))C\ * (f[D]) — f]{ * (— 
r + f(p), f(p) + r I r e TR+} n *(/[£)]) implies that 7 _1 [M/(p)) n 7[^]] - 7" 1 [/"(/W)] n *D - 
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f]{*(f~ 1 [(- r + f(p)J(p)+ r )]^ D ) I r G Now b y th e hypothesis, each f- 1 [(-r + f(p), f(p) + 

r))] n D is open in D. Hence, for p £ r + /(p), /(p) + r)] n D, there is some s e 1R+ such that 

p £ (-s+p,p + s)nfl C / _1 [(-^ + /(p)./(p)+0)] n£> - However, /i(p)n *£> C *(-s+p,p + s)n *L> 
implies that /z(p) n *D c 7~ 1 [M/(p))] n *D. Therefore, 

7[m(p) n *D] c 77 -1 [M/(p))] n 7[*d] c M/(p)) n 7[*d] c M/(p))- 

The proof is complete. | 

Prior to considering the notion of uniform continuity, here is a nonstandard proof of a rather 
interesting result. A real valued function is additive if for each p,q £ IR, f(p + q) = /(a) + f(q). 
Recall that I'm using simplified notation in that rather than write a statement such as *x £ " IR, 
this is often written as x £ IR. 

Theoreml0.3. Let f: IR — > M be additive. If f is bounded on some non-empty interval I, then 
f(x) = xf(l) for each x £ B and is a continuous function on IR. 

Proof. Clearly, *f:*TR — > *IR is additive. Additivity implies that for any rational r and any 
x £ IR, f(rx) = rf(x). Hence, *f(rx) — r*f(x) for each x £ *IR and *-rational r £ *IR (i.e. r g *Q). 
Let p be in the interior of / (i.e fi(p) C */). Then from boundedness, |*/[m(p)]| < M £ a R. 
Consequently, for each e g M (0), | 7(p + e)| = | *f{p) + *f{e)\ < M implies that | */(e)| < M+\f(p)\. 
Now for each n £ M, ne £ /i(0) implies, by additivity, that \ *f(ne)\ = n\*f(e)\ < M + |/(p)|. 
Therefore, for n £ IN', | */(e)| < (M + |/(p)|)/n. This yields that 7(e) £ fj,(0), Ve g ^(0). From 
the density of the rational numbers in IR, for any r £ TR + and any x £ IR, there is some q £ Q such 
that \x — q\ < r. By *-transform, we have that for e £ /x(0) + and x £ IR, there is q £ *Q such that 
\x — q\ < e. Hence, x — q £ (i(0) implies that there is some 7 g ^t(O) such that x = q + 7. Therefore 

f(x) - 7(« + 7) = 7(9) + 7(7) = *f(q ■ 1) + 7(7) = <?(/(!)) + 7(7)- 

Thus, g(/(l)) g Finally, /(a:) = Bt(g(/(1))) = ( B t(«))st(/(1)) = xf(l) and, obviously, / 

is continuous. | 

Theorem 10.4. If r £ IR, then there exists a hyperrational r £ *Q and some e £ /j,(0) such 
that r = q + e. 

You might try showing from Theorem 10.4 that if /: IR — > IR and Vx,y g IR f(x + y) = 
f(x)f(y), *f[*Q] C G(0),limo/(aO - 0, then f(x) = 0, Vx g IR. 

Recall that /:£)—> IR is uniformly continuous on D if for each r £ IR + there exists some 
w £ IR+ such that whenever x,y £ D and \x — y\ < w, then \f(x) — f(y)\ < r. Of course, this concept 
is highly significant in series work and integration theory. The follow characteristic follows in the 
usual manner, where the big difference between this and Corollary 9.14, extended to continuity, is 
that the points are not restricted to a particular /i(s). 

Theorem 10.5. The function f:D^>TRis uniformly continuous on D iff for each p,q £ *D 
such that q — p £ /i(0), 7(p) — 7(?) € A*(0). 

Theorem 10.6. Let f:D^TRbe continuous on compact D. Then f is uniformly continuous. 

Proof. Let p,q £ *D and p — q £ /i(0). Since *D C U{m(p) I P •= D}, then p,q £ fi(s) for some 
s £ D. Thus, from continuity, *f(p), *f(q) £ A*(/(s)) implies that *f(p) - *f(q) £ ri(0) and the 
result follows. | 
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Since being uniformly continuous is so important within analysis, I'll present a few more perti- 
nent propositions. 

Theorem 10.7. For real numbers a < b, let f: (a,b) — > TR. If ft € fi(b)~ fl (a, b) ( = /x(6)~) 
and *f(h) G(0), then for each r G TR + and each p G (a, b) there exists some q G (a,b) such that 
p < q < b and \f(p) — f(q)\ > r. (A similar statement holds for the end point "a. ") 

Proof. Let h G 11(b)- and p e (a, 6). Then p < h < b. Since */(ft) ^ G(0), then Vr e 
3R, | */(ft)| > r and, by reverse *-transform, there exists q G (a, 6) such that p < q < b and 
1/(9)1 > \f(p)\ + r for the \f{p)\ + r e m. Hence, |/(p) - /(<?)| > r. | 

Theorem 10.8. Let /: (a, 6) — > M be uniformly continuous. Then f(b—), f(a+) G IR.. 

Proof. Let ft G n *(a,6) = fJ,(b)~ . Then ft < b. Assume that *f(h) (/ G(0). Since 

h G *(o, b). Then, by *-transform of the conclusion of Theorem 10.7, there exist q G *(a, b) such 
that ft < q < b and | *f(h) — */(<z)l > 1- Since q G n(b)~ , this contradicts Theorem 10.5. Hence, 
*/(ft) G G(0) for each ft G n(b)~ implies that, for a particular ft, st(*/(ft)) = L. Now if fc G 
then uniform continuity implies that */(fc) G A*(st( *f(h)). Thus, f(b—) = L. The result for f(a+) 
is obtained in a similar manner with a similar Theorem 10.7 for a and this completes our proof. | 

Let nonempty E C D and /: E — > M. A function g: D — > M is called an extension of / iff for 

each x G E, g(x) = f(x). 

Theorem 10.9. Let f:(a,b) — > ffi. 6e uniformly continuous on (a,b). Then there exists an 
extension g of f such that g:TR^TRis uniformly continuous. 

Proof. Simply use the last theorem and define g(x) = f(x), for each x G (a, b) and g(x) = f(a+) 
for all x < a, and g(x) = f(b—) for all x > b. It's clear that g is uniformly continuous on TR. | 

Theorem 10.10. Let f: D — > TR be uniformly continuous on each bounded B C D. Then f has 
a unique continuous extension g: c\(D) — > K. 

Proof. A function like M is uniformly continuous on each bounded S C D iff for each 

p, q G *Dn G(0) such that p-?e ^(0), it follows that f{p) - f(q) G fi(0). Since p, g G *DD G(0) iff 
p, q G *flfl * [ - a, a] for some a £ TR. 

Define 5: cl(£>) -> TR as follows: let #(st(a;)) = st( *f(x)) for each x <E *D(~) G(0). This function 
is well defined since if st(y) = st(x), then st(*/(x)) = st( *f(y)) for each x,y G *Dfl G(0) by 
uniform continuity. Further, as we know, cl(D) = {x \ ri(x) n *D ^ 0} = {st(y) | y G *flfl G(0)}. 

Now 5 extends /, for if x £ D, then g(a;) = g(st(x)) = st(f(x)) — f(x). Now it's necessary 
to show that g is continuous for any p G c\(D). Let B = D n [—1 +p,p + 1] and r G IR + . Then 
there exists some w G E + such that w < 1 and for each x, y G £> such that |a; — y| < w it follows 
that — /(y) < r/2 by uniform continuity on bounded subsets of D. By ^-transform, for each 

x,y G *B, and \x - y\ < w, it follows that| *f(x) - *f(y)\ < r/2. Let b G cl(D) and |6-p| < w. Then 
b = st(y), p = st(x) for some x,y £ *B and |x— - y| < w. Consequently | *f(x)— *f(y)\ < r/2 implies 
that \st(*f(x)) — st(*f(y))\ < r/2. Therefore, in the usual manner, we have that \g(b) — g(p)\ < r. 
Hence, g is continuous at p. Finally, g is unique for if ft: c\(D) — > ffi. continuously and extends /, 
then for p G cl(£>), ft(p) = ft(st(x)) = st( *ft(x)) = st( *f(x)) = g(p), where x e *D and p = st(x). 

I need just one more extension result for the next chapter. 

Theorem 10.11. Let f:D — ► K continously and for each p G cl(D) — D, assume that 
lim p f(x) G IR. Tften / has a unique continuous extension g: c\(D) — > IR.. 
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Proof. Obviously, for each p G c\(D), we should let g(p) = lim p /(x) and g: c\(D) — > B is 
unique, since the limits arc unique and extends / for limf,/(x) = /(&), b € D. Let p € cl(D). 
Then /x(p) n *D ^ 0. Let = (—to + g(p),g(p) + w) be an open interval about g(p). Clearly, 
li{g{p)) C *W. and there exists r € 3R+ such that (—r + g(p),g(p) + r) C [— r + <?(p), <7(p) + r] C W. 
Since */[^(p) H *D] C /i(g(p)), it follows that there exists an n > such that for each open interval 
open I s = (— s + p,p + s),0 < s < n about p such that f[I 8 (1 D] C (— r + g(p),g(p) + r) and 
7 S n £> 7^ 0. Let q e I s n (cl(£>). Then since /i(gr) n *D ^ and n *D C */ s n *D, it follows 
that ^ *f[fi(q) n *D] C *(/[7 s (lD])c *( - r + g(p),g(p) + r). From the definition of g, we 
have that ri(g(q)) n *( - r + g(p),g(p) + r) ^ implies that g(q) e *[ — r + g(p),g{p) + r]. Thus 

G W. This yields that g[I s n (cl(D))] C W. Since is an arbitrary open interval about g(p), 
then f i(g(p)) = C\{*(-r + g(p),g(p) + r) \ r e TR+} and fi(p) n *(cl(£>)) C (*J S n («(£>))) imply 
that *g[fJ.(p) n *(cl(D))] C ri(g(p)) and the proof is complete. | 

Corollary 10.12. Lef continuous f:D — > M, D &e bounded and for each p e cl(_D) — D, 
lim p /(x) G K. T/ien / /ias a unique uniformly continuous extension g: cl(D) — > M. 

Proof. Since cl(D) is bounded and closed it is compact. Then the unique continuous extension 
g defined on cl(Z?) by Theorem 10.11 is uniformly continuous. 

Example 10.13. Assume that you have defined for x > 1 the exponential function x r for each 
rational r € Q and that you have shown that it is a strictly increasing function. Then due to the fact 
that Q is dense in B, it follows that on /: Q — > B, where f(x) = x r = sup{/(x) | (x € Q)A(f(x) < r}, 
f is a continuous function. But, c\(Q) = B. Now from the completeness of the real numbers, given 
any irrational r, then, in B, \im r f(x) = sup{/(x) | (x e Q) A (x < r)}. Thus, by Theorem 10.11, 
there is a unique continuous extension g oi f such that for irrational r € B, g(r) = lim r /(x) and 
this is the value of this exponential defined at r. 
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11. BASIC DERIVATIVE CONCEPTS 

I now come to the most striking difference between nonstandard analysis and the standard 
approach. Although the intuitive notions of the calculus are based upon "infinitesimal modeling," 
it was precisely the logical difficulties that occured using the intuitive infinitesimal approach that 
greatly influenced its abandonment. No such difficulties occur for these nonstandard infinitesimals. 
For what follows, please notice that D D D' is the set of all cluster points that are members of D 
and as such n'(p) fl *D ^ and p £ D. These are the members of D that are not isolated points. 
I'll denote DC) D' = D NI . 

Definition 11.1. (The Standard Derivative.) Let p £ D^i 1 p + h £ D, h ^ and 

f:D^>TR. Then the derivative at p (denoted by f'(p)) is finite and has value f'(p) iff lim (/(p + 
h) — f{p))/h = f'(p). The derivative /'(p) = ±oo iff lim (/(/i+p) — f(p))/h = ±oo and has geometric 
applications to the notion of "vertical" points of inflection. 

In all that follows, I'll use, as was done originally, the symbol dx to denote a member of //(0). 
This idea of dx being a special type of number was not carried over by Weierstrass when he refined 
the limit concept. The next theorem follows immediately from our characterizations for the limit 
notion. 

Theorem 11.2. Let f:D — > TR. Then for p £ D NI , f'(p) = s£ 1 [resp. ±oo] iff for each 
dx £ ri'(0) such that p + dx £ *D 

*f(p + dx)-f(p) _ r _ , . 



dx 



£ p{s) [resp. m±]. 



Note that if you let D = [a, b], a < b and f'(a) exists, then /'(a) is but the "right-hand" one- 
sided derivative. Clearly, this definition extends slightly the concept as it appears in the usual basic 
calculus course. The idea for the derivative is that it is a type of rate of change in infinitesimal values. 
In important physical applications, we need to know how infinitesimal rates of change compare with 
ordinary real number rates of change. Let f:D^>TR,y = f(x), x £ D, h £ TR such that x + h £ D. 
Then usually one writes the increment of (for) y at x, and h as Ay = f(x + h) — f(x) = A/(x, h). 
This / generated function (Af)(p,h) is actually a function that determines a hyperfunction by *- 
transform for any q £ *D, k £ *IR such that q + k £ *D. The ""-transform states that * (Af)(q, k) = 
*.f(q + k) - *f(q) - (A */)(<?, fc) - A*f(q,k). Thus, f(p) = s [resp. ±oo] iff A *f(p, dx)/dx £ ll{s) 
[resp. B^] for each dx £ //(0) such that p + dx £ *D. 

Theorem 11.3. Let f: D — > TR. Then f is continuous at p £ D iff A *f(p, dx) £ m(0), for each 
dx £ /x(0) such that p + dx £ *D. 

Theorem 11.4. If f:D — > IR and for p £ DNj,f'(p) £ TR, then f is continuous at p. 
Proof. Assume that f'(p) £ TR- Then for each dx £ p'(0), such that p + dx £ *D 

*f(p + dx)-f(p) 



dx 



Note that there always exists at least one such dx. Hence, *f(p + dx) — f(p) £ /x(0) implies that 
*f\p(p) n *D] C p(f(p)) and the result follows. | 

Our next notion is that of the differential. This is where we return to the time of Newton 
and Leibniz, something that could not be done prior to 1961. I mention that there are different 
approaches to the notion of the differential, especially for multi-variable functions. 
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Definition 11.5. (The Differential.) Let f:D -> TR, /'(p) e TR, dx G //(0), p + da; G *D. 
Then the differential is df = /'(p) dx G p(0). 

Theorem 11.6 Let /: L> -» H. 7/p € D, f'(p) G TR, then f(p) = st(£) /or eac/i dx G //(0) 
smc/i i/mi p + dx £ *D. 

Proof. Immediate. 

We need a better understanding of when the derivative exists and its relation to the differential 
and the infinitesimal increment. For this reason, let's call a function h(p, q) defined on A x B C 
*]R x *K, where /x(0) C -B, an infinitesimal function at p G A iff /i(p, dx) G /u(0), y dx e (J.(0)- 

Theorem 11.7. Let /: _D — > TR, p £ D^j. Then f'(p) £ TR iff there exists a unique t G TR and 
an infinitesimal function, h: {p} x ^(0) — > *B suc/i i/iai /or eac/i da; € /u'(0), w/iere p + dx e *D, ' 

A 7(p, dx) = */(p + dx) - f(p) = (dx)t + (dx)h(p, dx). 

Proof. For the necessity, simply define h(p,dx) — (*f(p + dx) — f(p))/dx — f'(p), dx ^ and 
h(p,0) = 0. Then let t = f(p). It follows that h(p,dx) e /i(0), Vdx e /i(0) and that *f(p + dx) - 
/(p) = (dx)t + (dx)h(p,dx), Vdx e /x'(0). The fact that f is unique follows from the definition of 
the derivative and the disjoint nature of the monads. 

For the sufficiency, let *f(p + dx) — f(p) = (dx)t + (dx)h(p, dx), dx € /x'(0) for each dx £ A*'(0) 
such that p + dx e *D, then ( *f(p + dx) - f(p))/dx - t = h(p, dx) e /x(0) implies that t = /'(p). | 

Note that Theorem 11.7 holds in all cases including the case that / is constant on some interval 
about p. The significance of Theorem 11.7 is that there are collections of infinitesimals called order 
ideals that give a type of measure as to how well the differential approximates the infinitesimal 
increment. For example, the facts are that for a fixed dx > 0, say, and, /'(p) ^ 0, then the set 
o(dx) = {^{dx) | 7 G p(0)} generates an ideal that's a subset of /x(0) with a lot of properties. 
Obviously, dxh(p,dx) G o(dx). One says that df is a first-order approximation for A*f(p,dx) for 
each dx. 

This notion of infinitesimal approximation is exactly how "curves" were viewed in the time of 
Newton and Leibniz. From Theorem 11.7 we have specifically that *f(p+dx) = f(p)+df+dx h(p, dx) 
holds Vdx G m(0). Thus within //(p), the monadic neighborhood about p, the *-line segment g(dx) = 
f(p) + dxf'(p), dx G fi(0) is a first-order approximation for any dx to the *-graph y = *f(p + dx). Of 
course, this can be phrased in terms of *-rangc values. One of the original definitions for a curve was 
that it is an infinite collection of infinitely small line segments. So, once again, we have a rigorous 
formulation for the original intuitive idea. And, yes, under certain circumstances there are "higher 
order" approximations. 

Although it's obvious from limit theory that the sum and product of functions /, g that are 
diffcrentiable at p (i.e. this means that /'(p), g'{p) G TR) are differentiable at p, the following two 
theorems demonstrate how easily the derivative "formula" and the chain rule are obtained. 

Theorem 11.8. Let f, g: D -» TR and /'(p), g'(p) G TR. Then 

(i) ifu = (f)(g), then u'(p) = f(p)g'(p) + f(p)g( P ); 

(ii) if g(p) 7^ and u = f/g, then 

, M g(p)f'(p) - g'(p)f(p) 
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Proof, (i) For dx E //(0) such that p + dx E *D, [ */(p + dx)] [ *g(p + dx)} = [f(p) + dx f'(p) + 
dx h(p,dx)][g(p)+dxg'(p)+dx k(p,dx)} = f(p)g(p) + (f(p)g'(p)+g(p)f'(p))dx+j dx where 7 G /z(0), 
and the result follows. 

(ii) For dx E //(0) such that p + dx E *D, 

*f(p + dx) f(p) A*f(p,dx) + f(p) f(p) 



A *u(p, dx) = 



*g(p + dx) g(p) A *g(p, dx) + g(p) g(p) 
g(p)A*f(p,dx) - f(p)A*g(p,dx) 



g(p)(A*g(p,dx) + g(p)) 

Thus, for dx ^ 0, 

A *uj P , dx) gjp)^^ - /Wj^gg 
dx g(p)(A*g(p,dx) + g(p)) 

The result follows by taking the standard part operator and using the fact that st(A *g(p, dx)) = 0. 
I 

Theorem 11.9. Letf:D^ m, p E D NI , g: f[D] -> B, /(p) e /[£>]*£. Iff'ip), g'(f( P )) G m, 
Tftera /or i/ie composition {gf)(x) = g(f{x)), x E D, (gf)'(p) E B and {gf)'(p) = g'(u)f'(p), u = 

Hp)- 

Proof. Let dx e /z'(0), p + dx E *D. Then + dx) = f(p) + k, k E fi'(0) 

by continuity. Hence, *g(*f(p + dx)) - g(f(p)) = k(g'(f(p)) + k(h g (f(p), k)) by Theo- 
rem 11.7, which also holds if / is a constant in any interval about p. Consequently, 
*g(*f(p + dx)) - g(f(p)) = (*f(p + dx) - f(p))g'(f( P )) + (*f(p + dx) - f(p))h 9 (f(p),k) = 
f\p)g\f(p))dx + g'(f{p))dxhf(p,dx) + f'{p)dxh g (f(p),k)+ 1 h f {p,dx)h g (f{ P ),k), 7 e/i(0)- How- 
ever, g'(f(p))dxhf(p,dx) + f'(p)h g (f(p),k) + jhf(p,dx)h g (f(p),k) E /u(0), for each dx E /x'(0) and 
the result follows. | 

Theorem 11.10. Suppose that f:(a,b) — > IR, a < b, has a derivative for each p E (a, 6) 
and both /, /' are uniformly continuous on (a,b). Then there is an uniformly continuous extension 
g: TR^> IR. that extends f and g' is a uniformly continuous extension f. 

Proof. We know that f'(b—), f'(a+), fib— ), f(a+) exist. The result follows by defining 

(f{x) xEia,b) 
g(x) = \f(b-) + f'(b-)(x-b) x>b . 
[f{a+) + f(a+)(x-a) x<a 

Let p — q E n(0). If p, q E * (a, 6), then the result follows from the hypothesis. If p, q E * [6, +00), 
then *gip) - *giq) = fib-) + /'(6-)(p - b) - fib-) - f'{b-)(q - b) = /'(&-)(p — q) E /i(0) and in 
like manner if p, q E *( — 00,0]. Let p E *(a, b), q E * [6, +00), g w b. Then g 6 implies, since 
p w g, that p w 6 and * 5 (p) = */(p) w /(&-). Now *g(g) = /(6-) + .f (6-)(g - 6) w /(6-), since 
q — bE ^(0). Hence, *5(p) — *g(g) G /i(0). In like manner, for (—00, a] and for g' . The fact that both 
g and g' are uniformly continuous follows from Theorem 10.5 and the proof is complete. | 

The basic calculus I idea of the local (relative) maximum or local minimum point requires in 
the definition quantification over the set of all open intervals about p E D. The interior of a set 
D denoted by int(-D) is the set of all interior points, where by Theorem 8.8, p E D is in int(D) 
iff nip) C *D. Theorem 8.8 eliminates one quantifier from the basic definition. Does a similar 



57 



Nonstandard Analysis Simplified 



elimination happen for a local maximum or local minimum? I'm sure you recall the definition 
relative to the existence of an interval about p that is contained in D. The quantifier eliminated is 
the "there exists." 

Theorem 11.11. Let f:D^>TR. A point p G int(D) determines a local maximum [resp. 
minimum] iff *f(q) < f{p), [resp. > ] Vg G 

Proof. The necessity follows from the definition and Theorem 8.8. 

For the sufficiency, assume that for every r G TR + such that (— r + p,p + r) C D there exists 
q r G (—r+p,p + r) such that f(p) < f(q r )- By ^-transform, we have that if r G /i(0) + , there is some 
q r G (— r +p,p + r) such that f(p) < *f(q r )- However, q r G fj,(p) C *D; a contradiction and this 
completes the proof for the local maximum. The local minimum is similar and the proof is complete. 

I 

Now for the major theorem used to find many of the local maximums or minimums. But, this 
theorem docs not restrict the derivative in the hypothesis to only finite derivatives, although the 
conclusion will do so. 

Theorem 11.12. Let f: D — > IR and p G int(D). If f is differentiable at p and p is a local 
maximum or minimum, then f'{p) = 0. 

Proof. First, let p be a local maximum and f'(p) G IR. Then for dx G /j.(0) + *f(p + dx) < f(p) 
and *f(p — dx) < f(p). Hence, 

*/(p + dx) - f(p) < Q < *f(p - dx) - f(p) 
dx —dx 

The result follows by taking the standard part of this inequality 

I now show that we cannot have that f'(p) = ±oo. Suppose that f'(p) = +oo. Then for each 
dx G /i'(0)+, ( *f(p + dx) - f(p))/dx > 1. This gives that *f(p + dx) - f(p) > dx > 0. Therefore, 
f(p + dx) > f(p)+dx> *f(p + dx) + dx from Theorem 11.11. This implies the contradiction that 
dx < 0. By considering a —dx, it also follows that f'(p) ^ — oo. In similar manner, the result holds 
for the local minimum and the proof is complete. | 

Prior to a generalization of Rollc's theorem, we need the notion of the boundary of a set D. 
First, recall that if D is bounded, then cl(D) is bounded. The boundary of D, dD, is exactly 
what you think it should be, dD = c\(D) n c\(R — D). The boundary of a set is a closed set and 
a nonstandard characteristic is obvious. For our basic sets, continuity at a boundary may be a 
one-sided continuity or even continuity at isolated points. I've mostly been giving definitions and 
even proofs, that are easily generalized to the multi- variable calculus. 

Theorem 11.13. A point p e dD iff fi(p) <jt *D and fi(p) n*fl^{). 

Theorem 11.14. Let f:D — > IR, D be bounded, int(D) 7^ and f is differentiable at each 
p e int(-D). Further, if p G int(D) and f'(p) — ±00, then f is continuous at p. Finally, assume that 
lim a f(x) = L, for each a G dD. Then there exists some q G 'mt(D) such that f'(q) = 0. 

Proof. Clearly, / is continuous on int(D). Assume there does not exist some q G int(D) such that 
f(q) = 0. First, let D = c\(D) and p <£ int(D). Since cl(D) = D = 'mt(D) U dD, then p G dD and 
p G D. This implies that lim p f(x) = L. Now assume, cl(D) ^ D and that p G c\(D) — D, p ^ 'mt(D). 
Then again p G dD. In this case by Corollary 10.12, there exists a unique continuous extension 
g: cl(.D) — > IR.. Now c\(D) is bounded and closed and, hence, is compact. Thus, for / [resp. g] there 
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is an x mi XM G C K^) where / [resp. g] attains its minimum at x m and maximum value at xm ■ 
But, since f(q) ^ for each q £ mk(D), then x m ,XM £ dD. However, since / is continuous on 
int(.D) and g is a continuous extension of / on dD, then L = g(x m ) < g(x) < j(im) = L, for each 
x £ cl(D), implies that g = f is constant on int(D). This implies that f'(q) = for each q £ int(D); 
a contradiction and the result follows. (Note: The possibility that f'(p) = ±00 for some p £ 'mt(D) 
is still valid.) 

Corollary 11.15. (Rolle's theorem) Let a < b, f: (a,b) — > TR be differentiable at each p £ (a, b), 
and f(a+) = f(b—), then there exists some c £ (a,b) such that /'(c) = 0. 

The following has a rather involved hypothesis. All of the requirements appear necessary for this 
generalization of the generalized mean value theorem. (Condition (1) holds if / and g are continuous 
on dD. Further, the conclusion obviously holds under certain conditions for any p £ int(D) where 
f'(p) ± 00 and g'(p) = ±00.) 

Theorem 11.16. Let f:D^> TR, g: D — > TR, where D is bounded and has non-empty interior. 
Let f and g be finitely differentiable at each p £ 'mt(D). 

(1) Let lim a f(x) £ TR, \im a g(x) £ 1R for each a £ dD and 
(lim/Or) - lim f(x))(g(a) - g(b)) = (lung(x) - limg(x))(f(a) - f(b)), Va, b £ 3D. 

a b a b 

Then for each a, b € 9D, then there is some p £ int(D) such that 

f'(p)(g(a) g(bj) = g'(p)(f(a) f(b)). (11.17) 

Proof. Let a,b £ dD and consider F(x) = f(x)(g(a) - g(b)), G{x) = g(x)(f(a) - f(b)). Now 
let h(x) = F(x) - G(x). Then h'(x) = F'(x) - G'{x) £ TR for each x £ int(D). Clearly, for each 
c £ dD, \im c h(x) — lim c f(x)(g(a) — g(b)) — lim c g(x)(f(a) — f(b)) and condition (1) yields that 
lim a h(x) — \imi,h(x) for each a, b £ dD. Thus, by Theorem 11.14, there is some p £ int(D) such 
that h'(p) — F'(p) — G'(p) — and the proof is complete. | 

Corollary 11.18. (Generalized Mean Value.) Let D = [a,b], a ^ b and f,g be finitely 
differentiable on (a, b) and both are continuous at a and b. Then there exists some p £ (a, b) such 
that f'(p)(g(a) - g{b)) = g'(p)(f(a) - f(b)). 

Proof. Condition (1) of Theorem 11.16 holds since /, g are both continuous at a, b. 

Corollary 11.19. Let D be compact and 'mt(D) ^ 0. Let continuous /:£)—» IR be finitely 
differentiable at each p £ int(D). Then, for each a,b,£ dD, there exists a p £ int(D) such that 
f(p)(b-a) = f(b)-f(a). 

I conclude this chapter on basic derivative concepts, by apply Theorem 11.16 to the theory of 
strictly increasing [resp. decreasing] functions. 

Theorem 11.20. If 'mt(D) 7^ 0, /: D — > IR continuously and f'(p) > [resp. < 0] for each 
p £ int(-D), then f is strictly increasing [resp. decreasing] on every [a,b] C D, a =/= b. 

Proof. Let a^b, [a, b] C D. Then [a, b] is compact and (a, b) C int(D). (The vnt(D) is the union 
of the collection of all open sets that are subsets of D.) Hence, the conditions of Corollary 11.19 hold. 
Thus, for any x < y, x, y £ [a, b] there exists some p £ (x, y) such that f(y) — f(x) = f'(p)(y — x) > 
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implies that / is strictly increasing on [a, b] . The proof for decreasing is similar and this complete 
the proof. | 

Corollary 11.21 If a < b and f: [a,b] — ► K continuously and f'{p) — 0, Vp £ (a, b), then f is 
constant on [a, b}. 

Finally, I remark that each of the previous theorems hold under *-transform and yield some 
rather interesting conclusions. Here are two examples with the first a slightly modified application 
of Corollary 11.19. Indeed, the major interest in the next result is when p fa q and this result is 
used in the next chapter. 

Theorem 11.22. Let int(D) ^ and f:D^>TRbe finitely differ entiable at each p £ int(D). 
Then for each p ^ q, p, q £ * (int(£))) such that [p, q] C * (int(D)) there exists some c £ * (p, q) such 
that 

r(c) = 7(p) - 7(g) 

p-q 

Proof. By *-transform. 

Theorem 11.23. (The first L'Hospital Rule.) Assume that f:(a,b) — > K, g:(a,b) — ► M anrf 
/or eac/i c e (a, 6), /'(c), c/'(c) G m and fif'(c) ^ 0. If f(a+) = g(a+) = ana" lim a+ (/'(x)/g'(x)) e 
IR [resp. ± oo], i/ien \m\ a+ (f (x) / g(x)) = L. 

Proof. Let (f'(x)/g'(x)) — > L as x — > a+ and define /(a) = 5(a) = 0. Then / and g are 
continuous at a. Then / and g satisfy the hypotheses of Corollary 11.18. Let p £ ti(a) + an d 
consider the *-transform of Corollary 11.18. Then there exists some t £ n(a) + such that a < t < p 
and L « */'(*)/ V(*) = (7(P) - /(«))/( *$(p) ~ 9(a)) « *f(p)/W) by considering the standard 
part operator and the fact that *t/(p) — t/(a) ^ 0. Thus lim a (/(a)/<7(a)) = £. The proof for ±00 is 
similar and the proof is complete. | 

Obviously, this last result holds for the substitution of x — > 6— for x — > o+. 

Corollary 11.24. Assume that f:(c,b) — > M, g:(c,b) — ► M and /or eac/i x € ((c, 6) — 
{ a })> /'O^)) fl 1 '^) e m aKC ^ #'( c ) 7^ 0) w ^ ere a € (c,o). 7/lim a /(x) = lim a g(x) = and 
lim a (/'(x)/(/(x)) = L [ resp. ±00], fften \im a (f (x) / g(x)) = L [resp. ±00 ]. 

Corollary 11.25. Under the hypotheses, of Theorem 11.23 [resp. Corollary 11.24], f or ea °h 
e,7G p'(0)+ [resp. fj,(0)], it follows that *f(a + e)/*g(a + j)fx *f (a + e) / *g' (a + 7) w L. 
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12. SOME ADVANCED DERIVATIVE CONCEPTS 

Before starting this chapter one small remainder. I will be working with non-trivial continuity 
and the derivative at a point p G D. In all cases, these are defined via non-isolated points. A 
rather simple observation is that p G D is not isolated iff there exists some q G /z(p) fl *D such that 
q 7^ p. Let's consider the ideas of the "higher order" differentials and their relation to "higher order" 
increments, as well as uniform differentiability, and some inverse function theorems. 

Recall the standard definition of the nth-order increment, where it is assumed the function 
is appropriately defined at the indicated domain members. It's defined by the recursive expression 
A«/(p,ft) = A(A"" 1 /(p,/i)), where A°f(p,h) = f(p), A/(p,ft) = f(p + h) - f(p). For example, 
A 2 /(P, h) = A(A/(p, ft)) = f(p + 2ft) - f( P + ft) - f(p + ft) + f(p) = f(p + 2ft) - 2/(p + ft) + f(p). 
Then A 3 /(;P, h) = f(p + 3ft) - 2/(p + 2/i) + /(p + ft) - /(p + 2ft) + 2/(p + ft) - /(p) = /(p + 3ft) - 
3/(p + 2ft) + 3/(p + ft) - /(p). From this we have that for any n£ I 

A"/(p,ft) = f:(-i) fc (™) /(p+ (n- ^ft) = f:(-i) ( ^ fe) (^) /(p+M), 

where ^ = n!/((n - fc)!fc!), < k < n is a "Binomial Coefficient." I now consider the nth 
derivative 

Theorem 12.1. For n G M', 6 G ]R + and suppose that /": [a, a + n&] — ► ffi.. Tften iftere exists 
some te (a, a + n6) sucft tfiaf A"/(a, 6) = f n (t)b n . 

Proof. This is established by induction. For n = 1, Corollary 11.19 yields the result. Let 
g(x,b) = f(x + b)-f(x). Then g n -\x,b) = /"-^z + fc)-/™" 1 ^) e m, for each a; G [a,a+(n-l)b]. 
Thus, there exists some to G (a, a + (n — 1)6) such that A n ~ 1 g(a, b) = g n ~ 1 (to,b)b n ~ 1 . Observe that 
A n - 1 g(a,b) = A n f(a,b). Hence, there exists some t x G (t ,t + b) such that #™ _1 (t ,&) = /™ _1 (*o + 
&)-/" _1 (*o) = / n (*i)&- Consequently, A^-^a.ft) = .a™" 1 ^, &)&™ _1 = f n (h)b n = A n f(a,b), where 
ii G (a, a + n&). The result follows by induction. | 

Corollary 12.2. Let f n : [a,b] — > H. Tften /or eacft dx G Ai(0) + and c G *[a,b), there exists 
some t G (c, c + ndx) sweft iftai A" */(c, etc) = *f n (t)(dx) n . 

Proof. This follows from *-transform and the fact that [c, c + ndx] C * [a, b). 

Observe that Theorem 12.1 and Corollary 12.2 clearly hold for the case that /": [a + nb, a] — > K, 
where b G m(0)~ . Now define the nth order differential at p for y = f(x) by d n y — d n f(p) = 
,f n {p){dx) n = f n (p)dx n . Of course, /°(p) = /(p). 

Theorem 12.3. Let /": [a, 6] ->B, n£ M'. 

(i) // /" is continuous at a, then for each dx G n(0) + an d eac h V G ft( a ) ^ * [ a i 

*r(p)*A n *f( P ,dx)/dx n . 

(ii) for eacft c G (a, 6) and eacft dx G ^(0) + , 

r(c)K*A n *f(c,dx)/dx n . 

Proof, (i) This obviously holds for n = by continuity at a. From Corollary 12.2 and assuming 
that n > 1, we have that for dx G /x(0) + and p G * [a, 6), there is some t G (p,p + ndx) such that 
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A™ *f(p,dx)/dx n = */(*)• Now l ct P e M ) n *K Then p rts a w t and continuity imply that 
/"(p) w A n *f(p,dx)/dx n . 

(ii) This obviously holds for n = 0, 1. For n > 2, in order to show this, I consider only interior 
points and establish this by induction. Notice that it's not required that /" be continuous on (a, b). 
Let w E TR + such that nonempty [c, +c + nw] C [c, c + (n — l)w] C [a, b]. Such a w always exists. 
Define g: [c, c + (n — l)tt;] — > IR, by w) — f(y + w) — f(y), !/ £ [c, c + (n — l)to]. The function 
g{y, w) satisfies the requirements of Theorem 12.1. Hence, there exists some t € (c, c + (n — l)u>) 
such that A n_1 g(c,w) = g n ~ 1 (t)w n ~ 1 . By *-transform, we have that if w = dx E ^(0) + i then there 
exists some ti e (c+(ra-l)dx) such that A™ -1 *g(c,dx) = *g n ~ 1 {t\) dx n ~ x . The definition of w), 
and the fact that for n > 2, in general, g n ~ 1 (y, w) = *f n ~ 1 (y + w) — ,f n ~ 1 (y) yields, by *-transform, 
that A™ */(c, dx) = A™" 1 *g(c, dx) and 

A Y-HC dx) dx^ = ^llll 

Consequently, 

A" 7(e, dx) = T^fti+dx)- *f n -\t 1 ) = 
dx n dx 

T^Hti + dx) - r-jjc) h + dx-c | r-^c) - r^jti) c - h = 

t\ + dx — c dx c—t\ dx 

*f n ~ 1 {c + dxi) - /"^(c) ii + dx - c /"^(c) - *.f l_1 ( c + dx 2 ) c - h ^ 
dx\ dx dx 2 dx 

where dxi = ti + dx — c, dx 2 = ti — c and dxi, dx 2 € /x(0) and the proof is complete. | 

Theorem 12.3 holds for the appropriate negative increments and these results relate directly 
to notion of the nth-order approximation via the nth order ideals. This is because (i) yields that 
*f n (p)dx n w A™ *f(p,dx) and (ii) *f n (c)dx n w A™ *f(c,dx). I have mentioned the first-order ideal 
generated by any dx. For n > 1, the nth-order ideal are generated by the dx™ and is a strict subset 
of the dx n_1 (ra — l)th order ideal. I mentioned that many of the standard theorems have useful 
nonstandard statements. One of these is the nonstandard mean value theorem. For any x,y E *1R, 
the nonstandard interval [x, y] restricted to members of a particular set A C IR is easily defined by 
^-transform. We know that VxVyVz((x E A) A (y E A) A (z E A) — » (z E [x, y] <-> x < z < ?/)). 
Thus, for p,q E *A, p < q, one simply considers the symbol [p, g] for this ""-transform. Such an 
abbreviation occurs in the ""-transform of Corollary 11.19. 

Theorem 12.4. Let f:D — ► B 6?/ finitely differentiable at each p E 'mt(D). For distinct 
p,q E *(int(Z))) such that \p,q] C *(int (£))), there is some c E *{p,q) such that *f'(c) = (*f(p) — 
*f(q))/(p-q). 

Theorem 12.4 is useful in the study of derivatives that are also continuous. Indeed, /:£)—> TR 
is said to be continuously differentiable on D iff /' is continuous on D. 
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Theorem 12.5. Let /: G — > IR be continuously differentiable on G, where G be an nonempty 
open subset of IR. Then for each ceG and p,q G n{c), f'{p) ~ ( */(<? + dx) — *f(q))/dx. 

Proof. Observer that ri(c) C *G. Suppose /' is continuous at c. Let dx G MO) 1 *"- We have that 
for any p G m( c )j [PiP + C *G or [p + dx,p] C *G, respectively. Thus by Theorem 12.4, there 
is some s such that, in either case, *f'(s) = ( *f(p + dx) — *f(p))/dx. By continuity, for any other 
q G n(c), */'(<?) ~ *.f'( s ) ~ *f(p) an d this completes the proof. | 

As is very well know the derivative can exist but not be continuous. Determining when the 
derivative is continuous is a substantial problem. With this in mind, I show how a slight change in 
the conclusion of Theorem 11.2 implies the continuity of the derivative. 

Definition 12.6 (Uniformly Differentiable.) Let /:£)—» IR, c G D NI and *f'(c) G *IR. 
Then / is said to be uniformly differentiable at c iff for each distinct x, y g p(p) H *D 

x-y 



By now you should have no difficulty translating Definition 12.6 into standard terms, where 
p G Djyi,f'(p) G IR. Then such a translating gives that for any w > 0, there's an open interval 
(— r + p,p + r) about p G D^i such that whenever distinct x,y G (— r + p,p + r) D D, then 
— (/( x ) — f(u))/( x ~ y)\ < w as the equivalent statement. The uniform part is the requirement 
that x, y be somewhat unrestricted within an interval about p. 

Theorem 12.7. If for nonempty open G C IR, /: G — > IR, p g G and /' is continuous at p. 
Then f is uniformly differentiable at p. 

Proof. This come from Theorem 12.5 by letting x — y = dx. | 

Example 12.8 Uniform differentiability was first investigated rather recently (Bahrens, 1972). 
It's major contribution is that a major theorem dealing with inverses, which was previously es- 
tablished for continuously differentiable functions, holds true for uniformly differentiable func- 
tions. There are many functions that are uniformly differentiable at a point but not differentiable 
throughout any open interval about that point and, hence, not continously differentiable at that 
point. As an example, consider a function constructed as follows on [—1,1]. Consider generat- 
ing a function / in the following manner. For each n > generate a collection of points by 
the recursion starting with x = ±1, /(±l/n) = 1, n = 1. Then, for each x = ±l/(n + 1), let 
/(±l/(n+ 1)) = f(±l/n) — l/(n 2 (n+ 1)). Then consider line segments, connecting successive pairs 
of these points as end points, as generating the function / defined on [—1,1]. The slope of each of 
these line segments n > from (±l/(n+ 1), /(±l/(n + 1)) to (±l/n, /(±l/n)) is 1/n. It follows 
that /'(0) = and that / is uniformly differentiable at p = 0. However, any interval (— r, r), r G M + 
about p = contains a point where /' does not exist. 

I mention that for the real numbers if I is an open set such that real p G /, then there always 
exists some r G IR + such that p g (— r + p,p + r) = I p C I. The I p is an open interval about p. 

Theorem 12.9. Let f:D^>~5R, p g D^i- If I is an open interval about p, and f is uniformly 
differentiable for each c G I H Dni, then f is continuous at p. 

Proof. Observer that since p G D^i iff ti'(p) *D ^ and p £ D' Thus, there are a lot of these 
open intervals I about p such that /' n D ^ 0. We first have that /'(c) G IR, c e Dnj. Let r G IR + . 
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Then there exists arae K + such that for each h £ IR such that 0< \h\ < w,c + h £ D 

f(c + h)-f(c) 



/'(c) - 



h 



< r. 



Let q £ fJ,(p) n * / n *D and any dx £ fi(0) such that q + dx £ fj,(p) n ( * J H *D). Hence, considering 
*-transform for arbitrary r £ 1 + , it follows that *f'(q) - 7(g+d ^~ " /(g) < r. Thus, w 

^ q+d dl~ ■ Uniformly differentiable yields that f'(p) ~ */'(<z)- The point q was an arbitrary 
member of /Lt(p) n */n *£>. Since /z(p) C */, this yields that */V(p) n *D] C n{f'(p)) and the proof 
is complete. | 

Corollary 12.10. If nonempty open G C D and f:D—* IR. is uniformly differentiable on G 
(i.e. at each c £ G), then f is continuous on G. 

Corollary 12.11. Let f:D^~5R, p £ Dni and f is uniformly differentiable at p. If for each 
q £ ri(p) n *D and dx £ ri'(Q) such that q + dx £ *D, *f(x) w ( *f(x + dx) - *f(x))/dx, then f is 
continuous at p. 

Although uniform differentiability at a point does not imply that the derivative is continuous 
at that point, what does happen is that it forces / to be continuous on an entire non-trivial set that 
contains p. 

Theorem 12.12. Suppose that f: D — > IR is uniformly differentiable at p £ int(D). Then there 
exists some open interval I C D about p such that f is continuous on I. 

Proof. Since p £ int(D), there are many open intervals / about p such that I £ D. Let 
L = \f'(p)\ + 1. Assume that for each open interval I £ D about p there is some y £ I such 
that / is not continuous at y. Since fi(p) C *D, then by ^-transform, each microinterval 7 7 = 
(—7 + p,p + 7), 76 A i (0) + contains some y such that */ is not *-continuous at y. This translates 
to say that there is some r £ *M such that for all w £ *IR + such that \x — y\ < w and */ is defined 
at x, then \ f *f(x) — *f(y)\ > r. Hence, for any such r, there is some x £ fi(0) such that x ^ y and 
r/L >\x- y\. Thus, | *f{x) - *f{y)\ > L\x — | > 0. Hence, \(*f(x) - *f(y))/(x -y)\> \f'(p)\ + 1. 
This contradicts uniform differentiability at p and the result follows. | 

Corollary 12.13. Let nonempty open G £ TR and f: G — > IR be uniformly differentiable at 
p £ G. Then there exists an open interval I such that p £ I, and f is continuous on I. 

I'll shortly use these ideas on uniform differentiability for an investigation of how the inverse 
function for an appropriate differentiable function behaves. 

Definition 12.14. (Darboux Property.) A function /: D — > IR is said to have the Darboux 
property of D iff for each a,b £ D such that [a, b] C D, either [/(a), /(&)] C /[[a, b}] or [/(&), /(a)] C 
[a, b]. Also recall that a function / on [a, b] is one-to-one or an injection iff for each distinct 
x,y £ [a,b] f(x)^f(y). 

Theorem 12.15. If f: [a,b] — > IR, a < b, is an injection and Darboux, then f is either strictly 
increasing or strictly decreasing. Further, f[[a, b}] is a non-trivial closed interval with end points 
f(a) and f(b). 

Proof. Since /(a) ^ f(b), I can simply assume that f(a) < f(b). Let x,y,z £ [a, b], x < 
z < y, f{x) < f(y), but f(x) ft f(z) or f(z) £ f(y). One-to-one implies that f(x) > f(z) or 
f(z) > f(y). If f(z) > f(y) > f(x), then the Darboux property implies that there exists some w 
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such that x < w < z and f(w) = f(y). Since w ^ y this contradicts one-to-one. In like manner, 
for f(x) ^ f(z). Therefore, f(x) < f(z) < f(y). Now let c, d € [a, b] such that a < c < d < b. 
Then f(a) < /(c) < f(d) and /(c) < f(d) < f(b). Thus, in this case, / is strictly increasing and 
/[[a, b]) = [/(a), /(&)]. A similar argument shows that if /(£>) < /(a), then / is strictly decreasing. 
The Darboux property now implies that f[[a, b]] is a nontrivial closed interval. | 

Corollary 12.16. Let continuous f: [a,b] — > B. Tften / is an injection iff / is either strictly 
increasing or decreasing on [a, b] . 

I now show that there are discontinuous functions that have the Darboux property. 

Theorem 12.17. If f:D —* TR is finitely differ entiable on D, then f has the Darboux property. 

Proof. All we need to do is to consider what happens if a, b € D and [a, b] C D and /'(a) < f'(b). 
Suppose that /'(a) < k < f'(b). Then the function g: [a, b] — > H defined by g(x) = f(x) — kx is 
finitely differcntiable on [a, b}. Hence g is continuous. Thus, there is some c € [a,b] such that 
<?(c) < g(x), Vi G [a, b], (i.e. (/(c) is the minimum value of g on [a,b]. Since </(x) = /'(x) — k, 
then #'(&) = f'(b) - k > 0. In like manner, 3 '(a) = f'(a) - fc < 0. Let da; e ^(0)". Then (*g(b + 
dx) — g(b))/dx > implies that *g(b + dx) < g(b). Consequently, there is some x € [a, b], by reverse 
*-transform, such that g(x) < g(b). In like manner, there exists some y € [a, 6] such that g(y) < g(a). 
Hence, a, b 7^ c. Therefore, (?'(c) = implies that /'(c) = fc and the proof is complete. | 

I point out that there are examples of functions finitely differcntiable on [0, 1] but with un- 
countable many discontinuities (Burrill and Knudsen, 1969, p. 191.) Finally, in this chapter, I'll 
investigate various types of inverse function theorems. Let /: (a, b) — > E be continuous. Then / 
is Darboux on (a, b). Thus, / defined on [c,d] C (a, b) is an injection iff / is strictly increasing or 
decreasing on [c,d\. In this case, / has an inverse function / _1 such that / _1 : /[[c, d]] — > [c,d] and 
/ _1 is an injection onto [c, d] which is also strictly monotone in the same sense. 

Theorem 12.18. Let the injection f:D^TRbe continuous on D and D is compact. Then the 
inverse function / _1 : f[D] — > D is continuous on f[D]. 

Proof. Let f(p) e f[D]. We know from one-to-one that */ is one-to-one and that for eachp e D, 
*f[f-i(p)n *D\ = *f[fi(p)]D *(f[D]). However, for our purposes consider from continuity that */[Mf , ) l ~ l 
*D] C m(/(p)) n * {f[D]). Let q G ri(f(pj) n * (/[£>]). Then there is some s e *D such that *f(s) = q. 
From compactness, there is a pi e -D, such that s € an d ^ *^] C ^ * (/[f]) 

implies that g e m(/(p)) ^ m(/(pi))- Thus /(pi) = /(p)- From one-to-one, this gives that p = pi. 
Consequently, g e implies that g e */[m(p) n *°]- Hence, */[m(p) n *°] = n *(/[-°])- 

One-to-one gives that /i(p) n *Z? = */ _1 [m(/(p)) H *(/[£>]). Thus, / _1 is continuous at /(p). | 

Theorem 12.19. Lef / &e an interval with more than one point. If the injection f: I — ► IR is 
continuous I, then / _1 : /[/] — > J is continuous on /[/]. 

Proof. For any pe J, pe [a, 6] C J, for some a, 6 such that a ^ b. 

Corollary 12.20. Let non-empty open Gel and i/ie injection f:G^>TRis continuous on G. 
Then f ■ f[G] — > G is continuous on f[G]. 

Theorem 12.21. For a < b, let the injection /: [a, 6] — > ffi. 6e continuous on [a, 6]. // non-zero 
f'( P ) em, pe [a, 6], then (/-7(/(p)) = l//'(p). 

Proof. Since / is continuous and one-to-one on [a,b], then / is strictly monotone. Assume / is 
strictly increasing. Then f[[a, b}} = [f(a),f(b)], f(a) < f(b). The injection Z" 1 : [f(a)J{b)\ -> [a, 6] 



65 



Nonstandard Analysis Simplified 



is continuous and strictly increasing on [/(a), /(£>)]. Let f(p) G [/(a), /(£>)]■ Clearly /(p) is a cluster 
point. Let dx g (J,(Q)' and /(p) + dx G n(f(p) n *[/(a), /(&)] and consider ft = *f~ 1 (f(p) + dx) - 
/ _1 (/(p))- Since / _1 is continuous and one-to-one, then ft g //(0). Further, one-to-one also implies 
that */(ft + p) = /(p) + dx. Now 

v „ V(fe+P)-/(P) = ^ 

7 w ~ ft ft 

and /'(p) 7^ imply, by considering properties of the standard part operator, that 

i M ft = *.r 1 (f(p)-dx)-f-\f(p)) 

f'(p) dx dx 

This completes the proof. | 

Corollary 12.22. Let non-empty open Gc 1 and the injection f:G-^TRbe continuous on 
G. Ifforpe G, + f{p) g m, then (/"^'(/(p)) = l//'(p). 

Theorem 12.23. Let strictly monotone f:D—fTRbe continuous on compact D and, forp g D. 
^ /'(p) g m. Then (/"^'(/(p)) = l//'(p). 

Proof. Assume that / is strictly increasing and, thus, one-to-one. The f^ 1 : f[D] — > D is 
continuous and strictly increasing on compact f[D]. Since p g D^i, then /x(p) n (*D — D) ^ 0. For, 
Q g /u(p) n ( *D - D), continuity and strictly increasing imply that *f(q) g /x(/(p)) n [ * (/[£>])- /[£>]]. 
Thus, /(p) is a cluster point. The proof now follows as in Theorem 12.21. | 

Example 12.8 can be modified to obtain a strictly increasing function of [—1, 1] such that 
/'(0) ^ 0. By Theorem 12.21, (/ _1 )'(/(0)) = l/.f(0) but it is not continuous since /'(p) does not 
exist on any open interval that contains 0. It is, however, uniformly differentiable. This is why the 
next result is a recent improvement over all other previous results relative to differentiable inverses. 

Theorem 12.24. Let /: D — > M, where D is compact, f is strictly monotone on D and at 
p g int(-D), 7^ /'(p) is uniformly differentiable. Then / _1 is uniformly differentiable at f(p) and 

(f-'Yifip)) = V/'(p). 

Proof. Assume that / is strictly increasing on D. Then / _1 exists for f[D]. Uniformly differen- 
tiable implies that / is continuous on some I C D, where I is an open interval about p. Thus, there 
exists [a, 6], (a^ b) such that p g [a, b] C J C D. Since [a, b] is compact, then the result that / _1 is 
differentiable at /(p) and that (/ _1 )'(p) = l/.f(p) follows from Theorem 12.23. 

Assume that dy g ju'(0), y g /i(/(p)) C */[*£>] such that y + dy E *f[*D] and 

■i W/ u^ 7 _1 (y + <*»)- 7 _1 (v) 



(r7(/(p)) 96 



dy 

Then, from properties of the "st" operator, 

f M & d JL 

7 [P> ^ *f-Hv + dy)- *f- 1 (y)' 

Observe that y = *f(q) for some unique q g n(p) n *D. Further, continuity of / _1 at f(p) and 
increasing imply that ^ *f~ 1 (y + dy) - *f~ 1 (y) = ft G //(0). Now *f~ 1 (y + dy) = q + h g *£> and, 
of course, q + h E fi(p). Thus, y + dy = *f(q + ft) yields = + ft) - */(<?)• Therefore, 
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a contradiction of uniformly differcntiablc of / at p. Thus, f'(p) is uniformly diffcrcntiablc at /(p) 
and the proof is complete. | 

Finally, I apply some of these previously results to establish a major classical theorem on inverse 
functions. THE inverse function theorem. 

Theorem 12.25. Let G be a non-empty open subset H. Let the /': G — ► IR be continuous 
on G. Then at any p G G, where /'(p) ^ 0, there exists an open intervals L and U such that 
pe/cG, /[/] = U C f[G] and /"* exists U, (J- 1 )' is continuous on U, and (/"^'(p) = l//'(p) 
/or eac/i p G [A 

Proof. Let p £ G and /'(p) ^ 0. I show first that f(p) G int(/[G]). Let p G (a, 6) C G. 
Since /'(p) 7^ and /'(p) is continuous on (a, 6) there exists some open interval Lq such that 
p G /o C (a,o) and /'(x) > for each x G /o or /'(#) < for each x G /o- Thus, / is strictly 
monotone and continuous on L a . So, / is one-to-one on L a . Further, there is a closed non-trivial 
interval [c, d] and the open (c, d) such that p G (c, rf) C [c, d] C /o • Consider that case where 
/(c) < /(<i). Since [c,d] is compact, it follows that for [/(c), /(d)], / _1 : [/(c), /(d)] — > [c, d] is 
continuous on [/(c), /(d)] by Theorem 12.18 and , hence, continuous on (/(c), /(d)). Now Theorem 
12.15 implies that f((c,d)) = (/(c), /(d)) C int(/[G]). Theorem 12.7 yields that / is uniformly 
differcntiable on (c, d). Theorem 12.24 gives that / _1 is uniformly diffcrcntiablc on (/(c), /(d)) 
and (/ _1 )'(a;) = l//'(y),/(y) = x for each x G (/(c), /(d)). Theorem 12.9 yields that (/- 1 )' is 
continuous on (/(c), /(d)). In like manner for the case /(c) > /(d) and the proof is complete. | 
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13. RIEMANN INTEGRATION 

Since I'm using an arbitrary free ultrafilter generated nonstandard model for analysis and the 
somewhat weak structure M. one should not expect that M models all aspects needed for Riemann 
integration. For this reason, a few results use standard proofs. One the other hand, I'll obtain many 
results relative to the Riemann integral by means of proofs using nonstandard techniques. Unless 
otherwise stated, all functions, such as /, discussed in the chapter will be bounded 
and, for a < b, map [a, b] into IR. This is a generalization for a basic analysis course of the usual 
Calculus I requirement that F is a (bounded) piecewise continuous function defined on [0,6]. 

One of the problems with Riemann integration is that it may be stated in terms of any partition 
of [a, b]. Nonstandard analysis allows us to eliminate this "any partition" notion. For our purposes 
a partition of [a,b] is but a finite collection of points {a = x < ■ ■ ■ < x n < x n+1 = b}. There 
are different ways to generate "simple partitions." The one now introduced is considered as a very 
simple type of partition of [a, b] and allows any positive infinitesimals to generator a nonstandard 
partition. 

Definition 13.1. (The Simple Partition.) In this chapter, let Ax always denote a positive 
real number. For Ax, there exists a largest natural number "n" such that a+n(Ax) < b. Define x n = 
a + nAx < b. Then there is a unique partition of [a, b], P(Ax) = {a = x < • • • < x n < x n+ i = b} 
such that for each [xi, Xi+i], 2^+1 — Xi = Ax, i = 0, . . . ,i = n— 1 and x n+ \ — x n = b — (a + nx) < Ax 
due to the statement dealing with n being the "largest n" such that < b — (a + nAx). 

It's possible that x n — x n+1 . Indeed, let Ax = b — a. Then n = 1 and x\ = x 2 = b. The 
existence of this unique largest n can be expressed in our formal language as follows 

Vx((x G ]R + ) -> 3y((y e M) A (a + yx < b) A Vz((z e M)(a + zx < b) -» (z < y)))). 

Further, for this unique n, there's a function from [0, n+ 1] — > [a, b] that generates all of the partition 
points, where x$ = a and x n +i = b. It's defined by letting Xk = (a+kAx), < k < n, x n+ i = b. Such 
functions are called partial sequences. For every Ax, there exists such a partial sequence. This 
allows one to define what is termed as a "fine partition" for each positive infinitesimal. For such Ax, 
the partial sequence has a hyperfinite domain since it's not difficult to show that if Ax = 76 m(0) + ; 
then the unique n is a member of BJoo. For this reason, such partial sequences generated by positive 
infinitesimals are often called hyperfinite sequences. 

Definition 13.2. (A Fine Partition.) Let dx,dy 7 dz etc. denote members of ri(0) + ■ For any 
dx, there exists a unique A e INqo such that a + Adx < b and V k € * ^IN if a + k dx < b, then k < A. 
The hyperfinite sequence S: [0, A] — > * [a, b] such that Xk = (a + k dx), k € [0, A] and xa+i = b yields 
a fine partition P(dx) of (for) [a, b], where Xk+i — Xfe — dx, < k < A, b = xa+i — xa < dx. 

Since I only consider bounded functions, then this investigation is based upon the completeness 
of the real numbers. So, as usual, for each closed interval [x,,x, + i], let rrii = inf{/(x) | Xj < x < 
x i+ i} and Mi = sup{/(x) | Xi < x < x i+1 }. (Recall that "inf" is the greatest lower bound of a set, 
and "sup" is the least upper bound.) Of course, vrti < Mj. 
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Definition 13.3. (Upper and Lower Sums). For each Ax, and the bounded function /, 
two operators are defined as follows: 



/n-l 



L(f, Ax) = Y rriiAx + m n (b - x n ) 



\ o / 

U(f, Ax) = ^ M * AX ^ + M ^ b - 

For the fixed function /, the lower sum L(f, ■): B + — > B, and the upper sum U(f, •): IR. + — > B 
have the usual nonstandard extensions. 

Definition 13.4. (Hyperfinite sums.) For each dx, *L(f,dx) and *U(f,dx) are called the 
lower hyperfinite sum and upper hyperfinite sum, respectively. I've used a slight abbreviation 
in this notation, where the / in the notation is actually */■ 

Theorem 13.5. For each dx and any f , 

(i) *L(f,dx) < *U(f,dx), 

(ii) *L(f,dx), *U(f,dx) e G(0). 

Proof. Since / is bounded on [a, b], then there exist n, m e TR such that m < f(x) < M, Vi e 
[a, b}. Consider any Ax. Then mAx < f(x)Ax < MAx yields that 

/ /n—l \ \ /n— 1 \ 



J2Ax )+(b - x n ) < I ^ miAx + m„(6 - x„) < 



M t Axj + M n {b - x n ) < M ^ ^ AxJ + (6 - x„) J , 

since these are finite summations. Hence, m(b — a) < Ax) < [/(/, Ax) < M(b — a). Then the 
sentence 

Vx((x e m+) -» m(6 - a) < x) < [/(/, x) < M(b - a)) 

holds in A4; and, hence, in *M. By *-transform, the result follows. | 

In the theory of Riemann integration, refinements of a partition play a significant role. They also 
present significant intuitive problems, as well. The next result is similar to a refinement proposition 
for Riemann sums. 

Theorem 13.6. For every Ax and for every p G *3N' = *M - {0}, 

*L(f,Ax)< *L(f,Ax/p)< *U(f,Ax/p)< *U(.f,A). 

Proof. It follows from the definition, that for each Ax and corresponding partition P(Ax) 
generated by Ax that P(Ax) C P(Ax/n),n € M'. Now I need to consider a standard argument 
at this point and direct you to Theorem 10.1 in Burrill and Knudsen (1969, p. 199), where it is 
established that, for our case, 

L(f, Ax) < L(f,Ax/n) < U(f,Ax/n) < U(f,Ax), 

for n e I'. The result follows by ^-transform. | 
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I point out that Theorem 13.5 shows that for each positive infinitesimal dx, since *L(f, dx) < 
*U(f,dx), then st(*L(f,dy)) < st(*U(f,dy)). Also, I won't continue to mentioned the fact that 
< *U(f,p) - *L(f,p) for cachp G *TR+. 

Definition 13.7. (Integrable Functions.) The function / is (simply) integrable iff there 
is some dx such that 

st(*L(f,dx)) = st(* U(f,dx)) iff 

*U(f,dx)- *L(f,dx) e v(0). 

If / is integrable, then we denote st(*L(f,dx)) = J a f dx as the integral, where it's understood 
that this is the simple integral. 

I also use the notation f dx G B to indicate that / is integrable on [a, b]. At the moment, it 
appears that the value of the integral might depend upon the dx chosen. I'll show, later, that this 
is not the case. Of course, it's clear from above that if f dx G M, then f (dx/n) G M, n G *W'. 
What functions are integrable? 

Theorem 13.8. If f is monotone on [a,b], then J a f dx G M. 

Proof. Assume that / is increasing. For an Ax generated partition, Mi — f(x i+ i), rrii — 
f(xi), i = 0, ... ,n. For n G M', let Ax = (b - a)/n. Then 

U(f(Ax) - L(f, Ax) = ((b - a)/n)(f(b) - /(a)). 

By *-transform, for each A € BJoo, where dx = (b — a) /A, 

*U(f,dx) - *L(f,dx) = dx(*f(b) - 7(a)) G fi(0). 

and the result follows. | 

What is needed is a general standard characterization for integrability in our sense. 

Theorem 13.9. The function f is integrable on [a,b], for some dx G /i(0) + , iff, for each 
r G IR + ; there is some Ax G such that 

U (/, Ax) — L(f, Ax) < r. 

Proof. Assume that f dx G IR and r G m+. Then dx G fi(0) + and *U(f, dx) - *L(f, dx) < r. 
The necessity follows by reverse *-transform. 

For the sufficiency, assume that r G IR. + and that there exists some Ax such that U(f, Ax) — 
L(f,Ax) < r. If n G M', then it also follows that U(f,Ax/n) - L(f,Ax/n) < r by Theorem 13.6 
restricted to M'. However, there always exists an n G M' such that < Ax/n < r. Consequently, the 
sentence 

Vx((x G m+) -> 3y((y G M+) A (y < x) A (U(f, y) - L(f, y) < x))) 

holds in AA; and, hence, in *AA. Letting 7 G n(0) + , there exists some dx such that *U(f,dx) — 
*L(f,dx) < 7. The result follows from Definition 13.7 since *U(f,dx) - *L(f,dx) G /x(0). | 

Corollary 13.10. TTie function f is integrable on [a,b] iff for each 7 G n(0)' there exists some 
dx such that *U(f,dx) — *L(f,dx) < 7. 
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I'll denote Riemann integration by the symbol R f dx. This form of integration is defined 
in terms of what appears to be a more general form of partitioning of [a, b]. This may be why 
many students, when they first encounter the complete definition for Riemann integration, find it 
somewhat difficult to comprehend. I'll show later that the simple integral as defined here for a rather 
simple type of partitioning is equivalent to the Riemann integral. There are two major but equivalent 
definitions for the Riemann integral. (A) You consider the same idea of the lower and upper sums, 
but you do not restrict the partitions. You define these sums in exactly the same way but for 
any partition P' . But, then you must also do the following. You consider the numbers R(f) = 
sup{L(/,P') | P' any partition of [a, b}} and R(f) = mi{L(f,P') \ P' any partition of [a, b}}. If 
R(f) — R(f), then this value is the Riemann integral of /. For the structure I'm working with, 
the collection of all such partitions is not part of the structure. So, this is why I need to use 
some results obtained by standard means. Then there is the more familiar equivalent definition. 
(B) You consider a general partition P' = {a = xo < ■ ■ ■ < x n — b}, n > 0, where one defines 
the mcsh(P) = maxjAx^ | (Axi = Xi — Xi-i) A (i = 0, ...,n)}. Then you consider any finite 
collection qi e [xi-\,Xi\ and evaluate the function at these points and consider the Riemann sum 
J2i f{li){ x i ~ x i-i)- Then a number R J a is the Riemann integral iff for each r e TR + , there exists 
a w € TR + such that for every partition P' such that mesh(P') < w and every % e [xi-\,Xi], 

n „b 

^2f(Qi)(xi - Xi-i) - R / j dx 

■y J a 



< r. 



Although the notation contains the symbol dx, infinitesimals are not mentioned in definitions (A) 
and (B). For bounded /, I use Definition (A) for the Riemann integral since the only difference is in 
the collection of partitions needed. For dx, the partition notation P(dx) is an abbreviation for the 
fine partition that can be explicitly defined for the dx. 

Theorem 13.11. // J* f dx e m, then J* f dx = R f dx. 

Proof. Let U(f, P'), L(f, P') be the upper and lower Riemann sums for a any general partition 
P'. Here is where I need a standard result about Riemann integration. It states that R f dx E TR 
iff, for each r <E IR + , there exists a general partition P' such that U(f,P') — L(f,P') < r. Also, 
L(.f,P r ) < Rj'afdx < U(f,P') (Burrill and Knudson, 1969, p. 202). But, a simple partition 
P(Ax) is a general partition. Indeed, for our partitions L(f,P') = L(f,Ax), U(f,P') = U(f,Ax). 
Consequently, Theorem 13.9 yields that R f dx e TR. And, further, by ^-transform, that for the *- 
Riemann partition P(dx), st( * U(f, dx)) = Rf^ f dx = f dx = st( *L(f, dx)) and this completes 
the proof. | 

Corollary 13.12. // J^fdx, f*fdy € TR, then f dx = f dy. 
Let's easily establish some of the basic integral properties. 

Theorem 13.13. Let bounded f and g be integrable on [a,b] for dx. 

(i) For each c G TR, f + g, cf are integrable on [a,b] and / (/ + g) dx — J a f dx + 
g dx, cf dx = c f dx, dx = b — a. 

(ii) If f(x) < g{x), yx e [a,b] then f dx < J^gdx. 

(iii) Ifm<f(x) < M, Vxe [a, b], then m(b - a) < f dx < M(b - a). 
Proof. These arc all established by simple observations about finite sums. 
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(i) Observe that 

L(f, Ax) + L(g, Ax) < L(f + g, Ax) < U(f + g, Ax) < U(f, Ax) + U(g, Ax). 

This result follows by *-transform using the dx and the standard part operator. 

Since L(cf,Ax) — cL(f,Ax) < U(cf,Ax) = cU(fAx), then result follows by using the stan- 
dard part operator for the given dx. 

Now observe that L(l, Ax) = U(\, Ax) = b — a. Thus, for the given dx, dx = b — a. 

(ii) Clearly for each Ax, L(f,Ax) < L(g,Ax) implies that st( *L(f,dx)) = f^fdx < 
st( * U(g, dx)) = g dx for the given dx. 

(iii) Simply apply (i) and (ii) and the proof is complete. | 

Monotone bounded functions need not be continuous, but they are integrable. The continuous 
functions should be integrable or this integral would not be very useful. There are very short 
nonstandard proofs of the following result but they require a more comprehensive structure than 
I'm using. The nonstandard proof that establishes the next result is longer than the standard proof 
since I've defined integration nonstandardly. So, I'll give the usual standard proof that depends 
upon the standard characterization of Theorem 13.9. 

Theorem 13.14. /// is continuous on [a,b], then f^fdx G TR for some dx G ^(0) + . 

Proof. Let r£B and let c = r/(b — a). From uniform continuity, there isawg E + such that 
for any x, y G [a,b] such that \x — y\ < w, then \ f(x) — f(y)\ < c. Consider any simple partition 
P(Ax) for [a, b]. Let [xi,Xi+i] be one of the subdivisions. Then there is x',y' € [xj,Xj+i] such 
that m 4 - fix',), M t = f(y'J. Hence, U(f,Ax)) - L(f,Ax) = YT ~\fWd - f(<))^ + {fiv'n) ~ 
f(x' n ))(b — x n ) < c(b — a) = r. The result follows from Theorem 13.9 and the proof is complete. | 

I mentioned previously that integration as here defined is equivalent to Ricmann integration. 
It's time to establish this. But, due to the weak structure I'm using, I need one more standard result 
about general partitions. 

Theorem 13.15. For each r e ffi+, there exists some w € M + such that for all partitions P' , 
where mesh(P') < w, 

< R(f) - L(f, P') < r, < U(f, P 1 ) - R(f) < r. 

Proof. This is establish in a portion of proof of Theorem 10.28 in Burrill and Knudscn (1969, 
p. 223.) 

Theorem 13.16. For each dx, 

R(f) - *L(f,dx) e m(0) + , *U(f,dx)-R(f) g m(0) + - 

Proof. Assume that there exists some dx such that R(f ) — *L(f, dx) ^ ^(0) + - Since < R(f) — 
*L(f, dx), then there exists some r G IR.+ such that R(f ) — *L(f, dx) > r. Now let arbitrary w G 3R + . 
Then, < dx < w. But, Theorem 13.15 holds for our simple partitions where mesh(P(Ax)) = 
Ax. Hence, there exists some wi G IR. + such that for each P(Ax), when Ax < u>\, then R(f) — 
L(f,P(Ax)) = R(f) - L(f,Ax) < r. By *-transform of this conclusion, it follows that R(f) — 
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L(f,dx) < r since < dx < w\; a contradiction. Hence, for each dx, R(.f) — *L(f,dx) e /i(0) + . In 
like manner, it follows that *U(f, dx) — R(f) G A i (0) + and the proof is complete. | 

Theorem 13.17. If R J a f dx e TR, then R f dx = f dx, for each dx. 

Proof. Let R f dx e TR. Consider arbitrary dx. Then R(f) — R(f) = R J f dx and, from 
Theorem 13.16, R f a f dx - *L(f, dx) e //(0), * U(f, dx) - R J* f dx e /z(0) yield the result. | 

So, all we need in order to study Riemann integration are the simple partitions and the simple 
integral that's independent from the actual dx that's used. This is a significant simplification. 
Further, it's obvious that if one wants to generalize the Riemann integral to other functions defined 
on [a, b] it's clear that either the partitions must be of a different type than the simple partition or the 
integral must be dependent upon the dx used. The major generalization is called the generalized 
integral and how under a definition similar to (B) such a generalized integral is equivalent to the 
Lebesgue integral. Anyone who studies the Lebesgue integral from the viewpoint of measure theory 
knows the subject can be difficult. It's a remarkable fact that the Lebesgue integral can be viewed as 
a Ricmann-stylcd integral under definition (B) for specially selected partitions and specially selected 
values for the function. From the nonstandard viewpoint, one major difference is that the Lebesgue 
integral is not infinitesimal independent. The infinitesimals needed are generated by objects called 
L-microgauges (Herrmann, 1993, p. 217.) But, all of this is well beyond the material in this book. 

It's a useful fact that any dx can be used to obtain the integral. This allows many standard 
results to be established easily. Recall that by definition f dx = 0, f dx = — J a f dx. 

Theorem 13.18. (i) Suppose that f a fdx € TR and c G [a, b]. Then f dx, f dx <E TR and 

Sa f dx = Ja f dx + Sc f dx - 

(ii) Ifc € [a, b] and f dx, f dx € TR, then f dx E TR and f dx = f dx + f dx. 

Proof, (i) Clearly, the result holds for c = a, c = b. So, let c € (a,b) and Ax = (c — a)/n,n € M'. 
Then all the points in the simple partition created by Ax for [a, c] are points in the Ax generated 
simple partitions for [c, b] and [a, b]. Hence, it follows that 

L(f, Ax, [a, b}) = L(f, Ax, [a, c]) + L(f, Ax, [c, b}) < 

U(f, Ax, [a, c]) + U(f, Ax, [c, b]) = U(f, Ax, [a, b}). 

Let A e Moo and dx = (c — a)/ A. By *-transform and the standard part operator and the fact that 
fa f dx € TR, we have that 

st( *L(f, dx, * [a, b})) = Bt( *L(f, dx, * [a, c])) + st( *L(f, dx, * [c, b})) = 

st(* U(f,dx, *[a,c])) + st(* U(f,dx, *[c,b])) = st(*U(f,dx, >,&])). 

But, st{*U{f,dx, *[a,c]))-et(*L(f,dx, *[a,c])) > 0, st(*U{f,dx, *[c,b]))-st{*L(f,dx, *[c,b])) > 
imply that st{*U{f,dx,*[a,c})) = st( *L(f, dx, * [a, c])) and st( *U(f, dx, * [c, b])) = 
st( *L(f, dx, * [c, b})) and the result follows. 

(ii) This follows by considering the same type of simple partition as in (i) and applying the 
standard part operator and the proof is complete. | 

Now let's consider a few of the most significant properties of the integral. 
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Theorem 13.19. Let f f dx G IR. For each y G [a, b], let F(y) = J v f dx. Then F is uniformly 
continuous on [a,b\. If f is continuous at c G [a, b], then F'(c) = /(c). 

Proof. From Theorem 13.18, F(y) G M. Thus, F is a function from [a,b] into E. Since / is 
bounded and again by applying Theorem 13.18, it follows that for any x, y G [a, b], \F(y) — F(x)\ < 
M\y — x\, where \ f{z)\ < M, Vz G [a, b]. Hence, by *-transform, if p, q G *[o, 6] and p — q G /x(0), 
then | - *-F(<z)| < M\p — q\ G /x(0) implies by Theorem 10.5 that F is uniformly continuous 

on [a, b]. 

Assume that / is continuous at c G [a, b]. Now considering the integral as the function F(y) = 
f dx, y G [a, b], our previous integral properties can by translated into properties about F, where 
for z, y G [a, b], f dx — F(y) — F(z). By *-transform, these *F function properties are relative to 
z,y G * [a, b]. Let p G n(c) such that p + c G * [a, 6]. First, assume that p < c. From the continuity of 
/; I *f(p) ~ /( c )l = 7 € ^(0)- Let <;(a;) = /(x) — /(c). Then from the hyper-properties for *G(x), wc 
have that G(c)- *G{p) = F(c)- *F(p)-f(c)(c-p). Moreover, \F(c)- * F(p)- f(c)(c-p)\ < j(c-p). 
Consequently, 

F(c) - *F(p) 



G M (0). 



c — p 

In like manner, for the case that p > c, and the result that F'(c) = /(c) follows. | 

Corollary 13.20. // f^fdx G B, p,g G *[a,6] and p - q e /u(0), t/ien *F(p) - *F(q) = 

*^7^e M (0). 

It's beyond the scope of this book to establish a necessary and sufficient for f dx to exist. The 
facts are that there are some very unusual functions that are integrable. For example, consider the 
non-negative rational numbers (in lowest form) q/p, p > 0. Define on [0, 1] the function f(p/q) = 
and, for each irrational r, f(r) = r. Then Jq 1 f dx exists. I leave it to the reader to find the exact 
value. It is rather easy however, to use our methods to show that the value of the integral is 
independent from the value of the bounded function at finitely many points in [a, b]. 

Theorem 13.21. Let f and g be bounded on [a,b] and there exists a non-empty finite set 
of numbers D = {po, . . . ,p n } C [a, b] such that f and g only differ on D. If J a f dx G TR, then 
$ b af dx = $ b a9dx. 

Proof. Consider any dx. Without loss of generality, we may assume that, for [c, d] G [a, b], d, 
that / and g differ at most at the end points {c,d}. Then *L(f,dx) = mo dx + J2i midx + 
toa-1 dx + m\(b — xa). Hence, st( *L(f, dx)) — st(^ 1 midx) = st(*L(g,dx)). In like manner, 
st(*U(f,dx)) = st(*U(g,dx)) and the result follows! 
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14. WHAT DOES THE INTEGRAL MEASURE? 

In this chapter, I present a slightly advanced look at the type of physical properties that the 
integral will measure. Also I continue to assume that / is bounded on [a,b]. 

Definition 14.1 (Additive Function.) A function B: [a, b] x [a, b] — > M is additive if for each 
simple partition P(Ax) = {a = xq < ■ ■ ■ < x n < x n+ i = b} 

B(xi,x i+1 ) = B(xi,x) + B(x,x i+ i), Xi<x<x i+1 , i = 0,...,n. 

(This is not the only definition in the literature for this type of additive function.) 

I note that in general B(xi,Xi) = 0. Of course, it's immediate that if f^fdx G 3R, then 
B(x, y) = F{y) — F{x) = f^fdx is additive on [a, b]. But, does the converse hold? If you are given 
a specific additive function B, then * B has meaning for any fine partition generated by dx since the 
definition of B is relative to the partition points and subdivision closed intervals. 

Definition 14.2. (Admissible for /.) A function B that is additive on [a, b] is admissible 

for /: [a, b] — > TR iff there exists some dx and for the fine partition P(dx) — {a — x < ■ ■ ■ < xa < 
x\+i = b} it generates, for each i = 0, . . . , A — 1, where b — xa — 0, there exists some pi G [x i7 x i+ i] 
such that 

* B{Xi d T' +l) * f{pi) e m(0) ' {ic) 

and if b — xa ^ 0, then there also exists some pa € [xa, b] such that 

7(pa) € MO). (ic) 

b - xa 

Theorem 14.3. Let B be admissible for /: [a, b] — > M. Then for each r G TR + , there exist dx, dy 
such that 

-r(b -a)+ *L(f, dy) < B(a, b) < * Iff, dx) + r(b - a). 

Proof. Let r € 1R + . Assume that for each dx, 

B(a,b)> *U(f,dx)= *U(f,dx)+r(b-a). (14.4) 

I make the following observation about B where B(a, b) > U(f + r, Ax). Let n > 1, and B(a, b) = 
(Y^o 1 x i+i + Ax)j +B(x n , b). Then there exists some k G [0, n— 1] such that B(xk, sj+Az) > 
MfcAx and if b — x n ^ (or n = 1), then B(x n ,b) > M n (b ~ x n ), where Mj = sup{.f(x) + r \ 
x G \xi,xi + Ax]} which exists by boundedness. Thus, by ""-transform and assuming that (14.4) 
holds, we have that there exists k G [0, A — 1], * B(x k ,x k+ i) > M k dx and if b — xa ^ 0, then 
B{x n , A) > Ma(6 — xa), where Mj = sup{/(a;) + r | x G [x^Xj + dx]}, which also all exist from 
boundedness and I need not consider the * sup since by definition the * sup — sup . Consequently, 
for each p G [x k ,Xk+i], *B(x k , x k +i) > (*.f(p) + r)dx, k G [0, A — 1] and if p G [xA,b], then, 
*B(xA,b) > ( *f(p) +r)(b — xa), where b - xa > 0. This implies that for each dx, k G [0, A - 1], 

*B(x k ,x k+1 ) 

- - fiP) > r, Vp G [x k ,x k+1 



dx 

and if 6 — xa ^ 0, then 



* B{XA,b) -*f(p)>r, V P e[xA,b}. 



b xa 
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This contradicts admissibility. Thus, there exists some dx such that B(a,b) < * U(f + r, dx). In like 
manner, there exists some dy such that — r(b — a) + *L(f, dy) < B(a, b) and this completes the proof. 
I 

Theorem 14.5. If B is admissible for integrable f, then B(a,b) = J f dx. 

Proof. Let r e B + . Then, from Theorem 14.3, there exists dx,dy such that — r(b — a) + 
*L(f, dy) < B(a, b) < *U(f, dx) + r(b — a). The result follows by taking the standard part operator 
and the fact that r is arbitrary. | 

Thus, the integral can be used to calculate the values of an admissible additive function. How- 
ever, the converse of Theorem 14.5 does not hold. Indeed, due the fact there are many unusual 
integrable functions, the converse does not hold where you define the additive function by the inte- 
gral itself. In the rather simple example below, it's shown that there are additive functions, indeed 
integrals, where (7(7) holds but not for all dx. 

Example 14.6. Define the integrable function f(x) = 0, Vi g [0,1), f(x) = 1, M x g [1,2]. 
Define B(x, y) = JJ / dx for all x, y £ [0, 2]. Then B(x, y) is additive on [0, 2]. Let A g M^, and let 
dx = 2/A and A by *-even. There exists some k g [0, A — 1] such that for each p E *[0, 2], p < 
x k, *f{p) = and p > Xk, */(p) = 1- We also know that for each k g [0, A — 1], m k dx < 
*B(x kj x k +i) < M k dx, where m k = in£{*f(x) \ x g [x k ,x k +i}}, M k = sup{ *f(x) \ x g [x k ,x k+1 ]}. 
Consequently, we have that for j g [0, A — 1], j < k 

*B{ Xj , Xj+l )_ =Q= , /(p)) Vpe[ j 



and for each j > k 

Thus, B is admissible. Now let dy — 2/T, but T is a *-odd number. Then again we have that 
B(x,y) = / dx — ^ f dy. However, there exists i e [0,T — 1] such that 1 is the midpoint of 
Xi,Xi+i and *B( X i, Xi+i) = dy/2. From, this is follows that the (IC) does not hold for this dy. 

The point x = 1 in the above example is a point of discontinuity for /. If you altered the 
definition of admissibility to have the (7(7) holds for all dx and for all pi g [x i7 x i+ i] you get a notion 
I called superncarness. I show in Herrmann (1993), that an additive function B is supernear to / 
iff / is continuous. And, of course, the B is equal to the integral. Let's complete this chapter by 
considering an additional property for an additive function, a property that models various geometric 
and physical notions. 

Definition 14.7. (Rectangular Property) An additive function A: [a,b] x [a, b] — ► TR, has 

the rectangular property for / iff for any c,d e [a, b],c < d, m(d — c) < A(c, d) < M(d — c), 
where, as usual, m — inf{/(a;) | x g [c, d]}, M = snp{f(x) \ x g [c, d]}. 

What does the addition of the rectangular property do for us? By ^-transform, consider dx. 
Then for each k g [0,a;A-i], m k dx < *A(x k , Xk+i] < M k dx, and if wa+i = 6,6 — x\ ^ 0, then 
?71a(6 — ita) < *^4(^A, 6) < Ma(6 — xa), where the m^, Mj are defined in the usual way. Consequently, 
in general, for such functions for each k g [0, A — 1], there is some g [xfe,a;fe + i such that 



M(x fe ,a; fe+ i) _ 



< M k ~m k 
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and if b — xa ^ 0, then there exists some pa g [x\, b] such that 

*A(x A ,b) 



b- x A 

This discussion leads to the following theorem 



- 7(p) 



< M A - m A . 



Theorem 14.8. Assume that A is additive and has the rectangular property for f and that there 
exists some dx such that for the simple partition P{dx), whenever k g [0, A— 1], then Mk—rrik g £t(0), 
where m k = inf{/(x) \ x g [x k , x k +i]}, M k = sup{/(x) | x g [x fe ,x fe+ i]}. Further, if b - x A ^ 0, 
then Ma — tua € m(0), tua = inf{/(x) | x € [xa,6]}, Ma = sup{/(x) | x g [xa,6]}. T/ien ^4 is 
admissible for f. 

Why is the usual application of the integral to functions that are piecewise continuous on [c, d]l 
Well, first of all, since the value of the integral is independent from the value of the function at the 
end points of the intervals of definition, then all that is needed is to consider why for a specific closed 
interval [a, b]. The next theorem shows why the result in Example 14.6 occurs. 

Theorem 14.8. A function f is continuous on [a, b] iff for each dx and, hence, each fine 
partition P(dx), whenever /eg [0, A — 1], it follows that M k — m k g /i(0) and if b — xa ^ 0, then 
Ma — tua g M0). 

Proof. Let / be continuous on [a, b]. The / is uniformly continuous. So, let's consider any dx 
and \p — q\ < dx, p,q g * [a, b]. Then p - q g ^(0) implies that *f(p) — *f(q) G m(0). Consider 
the simple partition P(dx). Since, for each h G [0, A — 1] there exist p,q g [x^, Xfc+i] such that 
M k = *f(p), m k = *f(q) as well as for the case that b — xa ^ 0, then the necessity follows. 

For the sufficiency, let p — q g /U(0), p,q g * [a, b]. Then there exists dx such that \p — q\ < dx. 
Consider a P{dx) fine partition. First, assume that for some k g [0, A — 1], p, q g [x^, x/c+i] or that 
p,g€ [xa, 6]. Then since M k — m, k g /u(0), */(p) ~ *./(?) G /u(0). If there does not exist some fc G [0, A] 
such that p,q G [xfe,Xfe+i], then p, g are in adjacent intervals by ^-transform of the standard case. 
So consider 2dx = dy and apply the first case argument to show that *f(p) — *f(q) G /u(0). Thus, / 
is (uniformly) continuous on [a, b] and the proof is complete. | 

Corollary 14.10. Let A be additive on [a,b] and have the rectangular property for f. If f is 
continuous on [a,b], then A(x,y) = f dx, x < y, x,y G [a, b] and the function A is unique. 

Thus, if you start with a geometric or physical property that is measured by an additive function 
A with the rectangular property for a continuous function /, then A is uniquely modeled by the 
integral. On the other hand, although the function / need not be continuous, if J / dx g IR, then 
for x < y, x, y g [a, b] the function A(x, y) — f^fdx is additive and has the rectangular property. 
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15. GENERALIZATIONS 

Much of what I've covered can be highly generalized. It should be obvious that this nonstandard 
approach, although very restricted in the language used, is not depended upon the codomain of the 
set of sequences used to obtain the equivalence classes with respect to a free ultrafilter. Hence, 
the set TR and all the additional ones used in M can be replaced with the set XU1, where X is 
non-empty. The *-transform method holds and many of the general results, such as Theorem 3.2, 
follow since they are all obtained from the properties of the ultrafilter. It would be better to have 
a stronger language and a structure where we can use G over variables. But, even in our restricted 
language, much can be done. I give just a few brief example. 

Definition 15.1. (Real Metric Space.) A nonempty set X is called a metric space iff 

there exists a function d: X x X — > TR such that for each x,y,z G X, 

(i) d(x,y) = d(y,x) > 0, 

(ii) d(x, y) = 0iftx = y, 

(hi) d(x, y) < d(x, z) + d(z, y). 

Definition 15.2. (General Finite Points and Monads.) Any q E *X is finite iff there 
is some p G X such that *d(q,p) G G(0). For each p G X, the monad of p is /i(p) = {x | (x G 
*X) A (*d(x,p) G /u(0))} = {x | (x G *X) AVr((r ^ 0) A (r G m) -» *d(x,p) < \r\)}. The set 
ns ( * x ) = U{m(p) I P € X}. 

In this more general case, what was previously the set G(0) is now denoted by fin( *X), the set 
of all finite points in *X. And, as before, ns(*A) C fin(*A). These sets are equal in the case that 
X = TR, but for metric spaces in general they are not equal. 

A closed sphere about pel, S[p, r] = {x \ d(x,p) < r}. A set B C A, for the metric space 
(X, d), is bounded iff there is some closed sphere S\p, r] such that B C S[p,r]. Now I assume that 
the theorem on the *-transform has been established for our structure. 

Theorem 15.3. For the metric space (X,d), B C X is bounded iff *B C fin(*A). 

Proof. For the necessity, the sentence Vx((a; G S\p, r] — ► d(x,p) < r) holds in A4; and, hence, 
in *M. Thus, by ^-transform, Vx((x G *(S\p,r]) -» *d(x,p) < r). Consequently, *B C *(S\p,r]) G 
fin(*A). 

For the sufficiency assume that B C X is not bounded. Let p G X. Then the sentence 
Vx((a; G JR + ) — > 3y((y G B)A(d(p,y) > x)) holds in M. Thus, by ^-transform, letting A G IR+ , then 
there exists some q G *B such that *d(q,p) > A. Now let p' G X. Then *d(q,p') cannot be a finite 
*-real number. For if we assume that *d(q,p') G G(0), then since d(p,p') G G(0) we would have that 
*d(p,q) < d(p,p') + *d(p',q) G G(0); a contradiction. Hence, q £ *(S\p',r]) for any r G TR + and any 
p G X. This completes the proof. | 

Corollary 15.4. A sequence S: M — > A is bounded iff *5(A) G fin( *A) /or eac/i A G IN^. 

The following results are obtained immediately in the same manner as the corresponding real 
number results. 

Theorem 15.5. For a metric space (X,d), a sequence S: M — > A converges to L iff *S(A) G 
/x(L) /or eac/i A G M^. 

Theorem 15.6. For a metric space, every convergent sequence is bounded. 
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Theorem 15.7. A point p G X is an accumulation point for a sequence S: M — > X iff there 
exists some A G INqo such that *S(A) G n(p)- 

Corollary 15.8. A sequence 5: IN — > X /ias a convergent subsequence iff there exists some 
A G INoo such that *S(A) G ns(*X). 

Theorem 15.9. A sequence S: W -> X is Cauchy iff *d(*S(A), *S(fl)) G fi(0) for each A, Q G 

It's possible for metric space, including the real numbers, to define monads at points q G *X — X 
by letting fi(q) = {x \ *d(x,q) G /u(0)}. Then it follows that 5: 3N — > A is Cauchy iff there exists 
some q £ *X such that *5(A) G /x(q') , VA G BJoo- One of the most significant metric spaces is the 
normed linear (vector) space. I consider a linear space over the real numbers for my example. 
If V is a linear space over the real numbers, then a norm is a map || • ||: V — > IR with the properties 
that, for each x, y G V, (i) > 0. (ii) For each re 1, ||rx| = \r\ \\x\\. (iii) \\x + y\\ < \\x\\ + \\y\\. 

The metric is defined by letting d(x, y) = \\x — y\\. Then you now apply nonstandard analysis to 
this space along with its additional linear space properties. For example, we have that fi(p) = {p+7 | 
7 G /i(0)} for such a metric space, in general. Nonstandard analysis has been applied extensively 
to linear spaces. For the major generalization known as the topological spaces, where I have 
established some immediately of the original results, we need a structure more directly related to 
set-theory and such an appropriate structure is not what I would consider as elementary in character. 
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APPENDIX 

Theorem Al. Let T be any filter on X. Then there exists an ultrajilterUx D ? '■ 

Proof. You can either use the set-theoretic axiom that states that this statement holds; or use 
Zorn's Lemma, which is equivalent to the Axiom of Choice. Let Q be the set of all filters that contain 
T . Suppose that C is a chain with respect to C in Q. I show that (JC is a filter that obviously would 
be a upper bound for this chain and is contained in Q. Clearly, ^ [JC. Let A G \JC. Then 
for some T\ G [JC. Hence, if A C B, then Bgfj implies that B G (JC. Now let A, B G \JC. Since 
Q is a chain with respect to C, then there is some JF 2 G Q such that A, B G JF 2 . Hence, Ap\ B E ^ 
implies that (J C is a filter that contains T . By Zorn's Lemma, there is a member of Q that is a 
maximal member U with respect to C . If U is not an ultrafiltcr, then there would be a filter U\ not 
equal to U such that U C U\. But, then U\ G 5; a contradiction of the (c) maximal property for 
This completes this proof. | 

I assume that the reader knows what I mean by a first-order language L with equality, where 
the constants represent objects in IR, IR 2 , . . ., and V(TR), V(TR 2 ), .... Equality is interpreted to be the 
identity on IR or sct-thcorctic equality elsewhere. The variables are denoted by Roman font. The 
first class of atomic formula are x G Y and, for n > 1, (x\, . . . ,x n ) G Y, where Y is a constant, 
and all possible permutations of members of the n-tuples, where Xk is either a constant or variable. 
I leave to the reader the trivial cases where the various expressions only contain constants and 
assume that, for all other formula, at least, one of the symbols that can differ from a constant is a 
variable. (The Y includes the +,-,<. Our result below holds for many other collections of atomic 
formula that describe members of more comprehensive structures, but I don't use them for this 
monograph.) Finally, a — b, where a, b are both variables or, at most, one is a constant. (Note: the 
symbols = is interpreted as a special binary relation within our language). Only a special set K of 
formula built from these atomic formula is used. Further, P G K if and only if P has only bounded 
quantifiers. That is each quantifier contained in the P is restricted to subsets of IR or IR™. Indeed, in 
most cases, a P with bounded quantifiers is usually equivalent to a form Vx((x G X) —>■■■) or the 
form 3x((x G X) A • • •). The reason I'm using bounded quantifiers is that the *-transform (Leibniz) 
property for such formula can be established without the Axiom of Choice. Given any P, then * P is 
obtained by placing every constant A in P by *A. (Note: The +, •, < are constants that technically 
should carry the * notation, but it's customary to drop this notation when the context is known.) 
In what follows, (a) = a and x G IR is considered in two context, either a constant for a member of 
IR or as varying over a subset of IR as the case may be. Let K be our set of formula and 

M = (TR,. .. , IR™, . . . , V(TR), V(TR n ), ...,+,-,<) 

*M = (*1R, . . . , *m", . . . ,V(*TR), . . .,V(*TR n ), ...,+,-,<). 

Theorem A2. Let P(x\, . . . ,x p ) G K contain at least one variable and X is a member of M. 
Define A = {(x\, ...,x p ) \ ((x\, . . . , x p ) G X) A (P holds in M)}. Then 

*A = {( Xl ,...,x p ) | {{xi, . . . , x p ) G *A)A(*Pholds in *M)}. 

Proof. Let P = (x G Y), where Y C H". Then A = {x | (x G X) A (x G Y)} = {x \ x G 
X} n {x | x G Y}. By Theorem 3.2 (vi)(xi), *A= *X n *Y = {x | (x G *X) A (x G *Y)}. Let 
P = (x = y), (or P = (x = x)). Let X C H, X x X = Y and A = {(x, y) \ ((x, y) G Y A (x = y)}. 
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The result follows from *X x *X = *Y and if a, b G X and a = 6, then [A] = [B] . For the case that 
X C B™, n > 1, we need either to identify X x X with the obvious 2n-ary relation or we need to 
extend the structure to include such objects and extend the results in Theorem 3.2 to cover these 
objects. This also depends upon your definition of the n-tuple. The case where x = a or a — x also 
follows in like manner. Note that since function or term symbols are not used in the language, then 
the = can be considered as used to generate specific relations that are elements of our structure. 

For atomic formula (xi . . . x p ) G Y, the result follows by application of Theorem 3.2 and the 
definition of *Y noting, of course, that (xi, . . . , x p ) G X. 

Since every first-order formula is equivalent to a formula that has all of the quantifiers to the 
left of a formula that contains no quantifiers but only formula built from atomic formula and the 
connectives A, V, —>,<-», -i. As for the connectives, it is well know that if we assume that the result 
holds for quantifier free formula V and W, then all we need to do to show, by induction, that the 
result holds in general for quantifier free formula is to show that it holds for V A W and for -V. 
For all but the atomic formula (xi, . . . ,x p ) G Y this is immediate by Theorem 3.2. Now let V be 
the expression (xi, . . . ,x p ) G Y and *A = {(xi, ...,x p ) | (xi, . . . , x p ) G * X) A ((xi, . . . ,x p ) G *Y)}. 
Then consider * B = {(xi, . . . , x p ) | (xi, . . . , x p ) G * X) A ((xi, . . . , x p ) *Y)}. The *-transfer holds 
since * B = *X — * A from Theorem 3.2. 

Now let V = (xi, . . . , x p , yi, . . . , y q ) G Y, W — (xi, . . . ,x p , zi, . . . ,z r ) G Z, where I assume 
the possibility that both V and W contain xi , . . . , x p and the other constants or variables are 
distinct from these. Let (xi, . . . , x p , yi, . . . , y q , z\, . . . , z r ) — (xi, . . . , z r ). Let B — {x\, . . . , z r ) \ 
(xi, ...,z r ) G X) A ((xi,. . .,x p ,yi, ...,y q ) G Y)} and C = {xi, ...,z r ) \ (xi, ...,z r ) G X) A 
((xi,.. .,x p ,z u . .. ,z r ) G Z)}. Then A = {(x U ---,z r ) \ {x u ...,z r ) G X) A (V A W)} = B n C. The 
result holds from the induction hypothesis, in this case, since *A = * B n *C. 

As mentioned, any first-order formula is equivalent to one which can be written as V = 
(qx n+ i) ■ ■ ■ (qxi)W, where xi, . . . , x„ + i are free variables in W and IF is a finite combination 
via A and -i of all of our quantifier free atomic formula. Hence, represent this formula by 
W(yi, . . . , y p , xi, . . . , x„ + i). We can always assume that ((/x„ + i)V = (3x n+ i)V (for if not, con- 
sider -tV) and V is also in this special quantifier form. If n = 0, then the result has been es- 
tablished. Assume the result holds for an appropriate member of K with the number of quan- 
tifiers < n. Under our requirements, x n+ \ is restricted to a member Z of our structure. Let 
D = {(t/i, . . .,y p ),x n+1 ) | ((yi, . . .,y p ),x n+ i) G X x Z A (qx n . ..qxi)W}, where I C B p is also 
in the standard structure. Then, by induction, and Theorem 3.2, *D = {(yi, . ■ ■ , y p ), x n+ \) \ 
((y 1 ,...,y p ),x n+1 ) G *X x *Z A (qx n . . .qx 1 )*W}. Let A = {(yi,...,y p ) \ {(yi,...,y p ) G 
X) A (3x„+i((x„+i G Z) A (qx n . . . qx\)W))}. Using this and a simple modification of the proof of 
Theorem 3.2 (x), it follows that the domain of * D = *A, where *A = {(yi, . . . , y p ) \ ((j/i, • • • , y p ) G 
*X) A (3x„ +1 ((x„ +1 G *Z) A (qx n ...qx 1 )*W))} = {(yi,...,y p ) \ ((y u ...,y p ) G * X) A *V}. By 
induction this completes the proof. | 

Theorem A3. Let V G K be a sentence with necessary quantifiers or be compose only of 
connected atomic formula expressed only in constants. Then V holds in M. iff *V holds in *M. 

Proof. Since V is a sentence it has no free variables. If V contains no quantifiers, then V only 
contains constants and the result follows from the definition of the hyper-extension and Theorem 
3.2. 

Now assume that V contains quantifiers and that it is written in the equivalent form (prenex 
normal form) V = (qx n ■ ■ ■ qx\)W and qx n = 3x„. (If this is not the case, consider the negation.) To 
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say that V holds in M. means that A = {x n \ (x n G X) A ((gx„_i • • • qxi)W holds in M)} 7^ 0, 
where X is the domain for 3x„. But A ^ iff = *0 = *A = {x„ | (x„ € * X) A 
((gx n -i • • • gxi) *W holds in *M)}. This completes the proof. | 
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